skip to main content
article
Free Access

Partition based spatial-merge join

Authors Info & Claims
Published:01 June 1996Publication History
Skip Abstract Section

Abstract

This paper describes PBSM (Partition Based Spatial-Merge), a new algorithm for performing spatial join operation. This algorithm is especially effective when neither of the inputs to the join have an index on the joining attribute. Such a situation could arise if both inputs to the join are intermediate results in a complex query, or in a parallel environment where the inputs must be dynamically redistributed. The PBSM algorithm partitions the inputs into manageable chunks, and joins them using a computational geometry based plane-sweeping technique. This paper also presents a performance study comparing the the traditional indexed nested loops join algorithm, a spatial join algorithm based on joining spatial indices, and the PBSM algorithm. These comparisons are based on complete implementations of these algorithms in Paradise, a database system for handling GIS applications. Using real data sets, the performance study examines the behavior of these spatial join algorithms in a variety of situations, including the cases when both, one, or none of the inputs to the join have an suitable index. The study also examines the effect of clustering the join inputs on the performance of these join algorithms. The performance comparisons demonstrates the feasibility, and applicability of the PBSM join algorithm.

References

  1. Arc95 ESRI, Redlands, CA. "ARC/INFO: The World's GIS. An ESRI White Paper", March 1995.Google ScholarGoogle Scholar
  2. Ben75 J. L. Bentley. "Multidimensional Binary Search Trees Used for Associative Searching". In Communication ofthe A CM, volume 18(9), September 1975. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Ben79 J. L. Bentley. "Multidlmensionat Binary Search Trees in Database Applications". In IEEE Trat~sactions on Software Engineering, volume 5(4), 1979.Google ScholarGoogle Scholar
  4. BHF93 L. Becker, K. Hinnchs, and U. Finke. "'A New Algorithm for Computing Joans With Grid Files". In IEEE Transactions on Knowledge and Dam Engineering, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. BKS93 T. Brinkhoff, H. P. Kriegel, and B. Seeger. "Efficient Processing of Spatial Joins Using R-trees". In Proceedings of the 1993 A CM-SIGMOD Conference, Washington, DC, May 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. BKSS90 N. Beckmann, H. R Kriegel, R. Schneider, and B. Seeger. "'The R*-tree: An Efficient and Robust Access Method for Points and Rectangles". in Proceedbtgs of the 1990 A CM-SIGMOD Conference, June 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. BKSS94 T. Brinkhoff, H. R Kriegel, R. Schneider, and B. Seeger. "'Multi-step Processing of Spatial Joins". In Proceedings of the 1994 A CM-SIGMOD Colference, Minneapolis, May 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Bur86 R A. Burrough. "Principles of Geographic hzformation Systems for Land Resources Assessment". Oxford University Press, 1986.Google ScholarGoogle Scholar
  9. CDF+94 M. J. Carey, D. J. DeWitt, M. J. Franklin, N. E. Hail, M. McAuliffe, J. F. Naughton, D. T. Schuh, M. H. Solomon, C. K. Tan, O. Tsatatos, S. White, and M. J. Zwilling. "Shoring up Persistent Applications". In P~vceedings of the 1994 ACM- SIGMOD Conference, "Minneapolis, Minnesota", May 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. CFR87 T. Sellis C. Faloutsos and N. Roussopoulos. "Analysis of Object Oriented Spatial Access Methods". In Proceedings of the 1987 A CM-SIGMOD Conferelzce, San Francisco, May 1987. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Cor95 Intergraph Corporation. "GIS/AM/FM Information". "http://wwm intergraph'c~m/utihnap'shtmt ", 1995.Google ScholarGoogle Scholar
  12. DKL+94 D. J. DeWitt, N. Kabra, J. Luo, J. M. Patel, and J, Yu. "Claent-Server Paradise". In Proceedings of the 20th VLDB Colf, Santiago, Chile, September 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. DLPY93 D. J. DeWitt, J. Luo, J. M. PateI, and J. Yu. "Paradise -A Parallel Geographic Information System". In Proceedings of the A CM Workshop on Advance~ in Geographic Information Systems, Arlington, Virginia, November 1993.Google ScholarGoogle Scholar
  14. DNSS92 D. J. DeWitt, J. R Naughton, D. A. Schneider, and S. Seshadri. "'Practical Skew Handling in Parallel Joins". In Proceedbzgs of the 19th VLDB Coil, August 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. GS87 R. H. Gtiting and W. Shilling. "A Practical Diwde-and- Conquer Algorithm for the Rectangle Intersection Problem". In bzformation Sciences, volume 42, 1987. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Gün93 O. G~nther. "Efficient Computation of Spatial Joins". In IEEE Transactions on Knowledge and Data E1zgineering, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Gut84 A. Gutman. "R-trees: A Dynamic Index Structure for Spatial Searching". In Proceedings of the 1984 A CM-SIGMOD Conference, Boston, Mass, June 1984. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. HNKT90 L. Harada, M. Nakano, M. Kitsuregawa, and M. Takagi. "Query Processing Methods for Multi-Attribute Clustered Relations". In Proceedings of the 16th VLDB Conf, Brisbane, Australia, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. HS95 E. G. Hoel and H. Samet. "Benchmarking Spatial Join Operations with Spatial Output". In Proceedings of the 21st VLDB Conf., Zurich, Switzerland, September 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. KHT89 M. Kitsuregawa, L. Harada, and M. Takagi. ~'Join Strategies on KD-Tree Indexed Relations". In IEEE Transactions on Knowledge and Data Engineering, 1989. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. LR94 M. L. Lo and C. V. Ravishankar. "Spatial Joins Using Seeded Trees". In Proceedings of the 1994 A CM-SIGMOD Conference, Minneapolis, May 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. LR95 M.L. Lo and C. V. Ravishankar. "Generating Seeded Trees From Data Sets". In Proceedings of the Fourth International Symposium on Large Spatial Databases, Portland, ME, August 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. LR96 M.L. Lo and C. V. Ravishankar. "Spatial Hash-Joins". In Proceedings of the 1996 A CM-SIGMOD Conference, Montreal, Canada, June 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. MC80 C. Mead and L. Conway. "bTtroduction to VLSI Systems". Addison-Wesley, Reading, Mass., 1980. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. MGR91 D.J. Maguire, M. E Goodchild, and D. W. Rhind. "Geographic h~formation Systems" volume 1. Longman Scientific & Technical, copublishect in the US with John Wiley & Sons, Inc. New York, 1991.Google ScholarGoogle Scholar
  26. NHS84 J. Nievergelt, H. Hinterberger, and K. C. Sevcik. "The Grid File: An Adaptable, Symmetric Multikey File Structure". A CM Tra~zsactions on Database Systems, March 1984. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. NS86 R. C. Nelson and H. Samet. "A Consistent Hierarchical Representation for Vector Data". In Computer Graphics, volume 20(4), August 1986. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. OM84 J. A. Orenstein and T. H. Merrett. "A Class of Data Structures for Associative Searching". In Proceedings of the 3rd A CM SIGA CT-SIGMOD Symposium on Principles of Database Systems, 1984. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. OM88 J. A. Orenstein and F. A. Manola. "PROBE Spatial Data Modeling and Query Processing in an Image Database Application". In IEEE Transactions on Software Engineering, volume 14(5), May I988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Ore86 J, A. Orenstein. "Spatial Query Processing in an Object- Oriented Database System". In Proceedings of the I986 ACM- SIGMOD Conference, 1986. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Ore89 J. A. Orenstein. "'Redundancy in Spatml Databases". In Proceedings of the 1989 A CM-SIGMOD Conference, 1989. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Ore90 J. A. Orenstein. "A Comparison of Spatial Query Processing Techniques for Native and Parameter Spaces". In Proceedings of the 1990 ACM-SIGMOD Colference, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. PD J. M. Patel and D. J. DeWitt. "Partition Based Spatial-Merge Join". http://www.cs.wisc.edu/paradise/paradise.papers.html. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. PS88 F. R Preparata and M. I. Shamos, editors. "Computational Geometry" Springer, 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Rot91 D. Rotem. "Spatial Join Indices". in IEEE Transactions on Knowledge and Data Engineering, Kobe, April 1991.Google ScholarGoogle Scholar
  36. SFGM93 M. Stonebraker, J. Frew, K. Gardels, and J. Meredith. "The SEQUOIA 2000 Storage Benchmark". In Proceedings of the 1993 ACM-SIGMOD Conference, Washington, D.C., May 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Sto86 M. Stonebraker. "The Case for Shared Nothing". Database Engineering, 9( 1 ), 1986.Google ScholarGoogle Scholar
  38. Tig U.S. Bureau of the Census, Washington, DC. "TIGER~Line Files(TM). 1992 Technical Documentation".Google ScholarGoogle Scholar
  39. TY95 K L. Tan and J. X. Yu. "A Performance Study of Declusterlng Strategies for Parallel Spatial Databases". In The 6th hzternational Conference on Database and Expert Systems Applications (DEXA), London, United Kingdom, September 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Ube94 M. Ubell. "The Montage Extensible DataBlade Architecture". In Proceedings of the 1994 A CM-SIGMOD Conference, May 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Val87 R Valduriez. "'Join Indices". InACMTODS, volume t2(2), 1987. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. ZG90 H. Zeller and J. Gray. "An Adaptive Hash Join Algorithm for Multiuser Environments". In Proceedings of the 16th VLDB Co~, Brisbane, Australia, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Partition based spatial-merge join

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM SIGMOD Record
        ACM SIGMOD Record  Volume 25, Issue 2
        June 1996
        557 pages
        ISSN:0163-5808
        DOI:10.1145/235968
        Issue’s Table of Contents
        • cover image ACM Conferences
          SIGMOD '96: Proceedings of the 1996 ACM SIGMOD international conference on Management of data
          June 1996
          560 pages
          ISBN:0897917944
          DOI:10.1145/233269

        Copyright © 1996 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 1 June 1996

        Check for updates

        Qualifiers

        • article

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader