skip to main content
10.1145/2463676.2463700acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

TOUCH: in-memory spatial join by hierarchical data-oriented partitioning

Authors Info & Claims
Published:22 June 2013Publication History

ABSTRACT

Efficient spatial joins are pivotal for many applications and particularly important for geographical information systems or for the simulation sciences where scientists work with spatial models. Past research has primarily focused on disk-based spatial joins; efficient in-memory approaches, however, are important for two reasons: a) main memory has grown so large that many datasets fit in it and b) the in-memory join is a very time-consuming part of all disk-based spatial joins.

In this paper we develop TOUCH, a novel in-memory spatial join algorithm that uses hierarchical data-oriented space partitioning, thereby keeping both its memory footprint and the number of comparisons low. Our results show that TOUCH outperforms known in-memory spatial-join algorithms as well as in-memory implementations of disk-based join approaches. In particular, it has a one order of magnitude advantage over the memory-demanding state of the art in terms of number of comparisons (i.e., pairwise object comparisons), as well as execution time, while it is two orders of magnitude faster when compared to approaches with a similar memory footprint. Furthermore, TOUCH is more scalable than competing approaches as data density grows.

References

  1. W. G. Aref and H. Samet. A Cost Model for Query Optimization Using R-Trees. In GIS '94.Google ScholarGoogle Scholar
  2. W. G. Aref and H. Samet. Cascaded Spatial Join Algorithms with Spatially Sorted Output. In GIS '96. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. W. G. Aref and H. Samet. Hashing by Proximity to Process Duplicates in Spatial Databases. In CIKM '94. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. L. Arge, M. de Berg, H. J. Haverkort, and K. Yi. The Priority R-tree: a practically efficient and worst-case optimal R-tree. In SIGMOD '04. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. L. Arge, O. Procopiuc, S. Ramaswamy, T. Suel, and J. S. Vitter. Scalable Sweeping-Based Spatial Join. In VLDB '98. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. N. Beckmann, H.-P. Kriegel, R. Schneider, and B. Seeger. The R*-Tree: an efficient and robust access method for points and rectangles. SIGMOD Record, 19(2):322--331, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. T. Brinkhoff, H.-P. Kriegel, and B. Seeger. Efficient Processing of Spatial Joins Using R-Trees. In SIGMOD '93. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. J.-P. Dittrich and B. Seeger. Data Redundancy and Duplicate Detection in Spatial Join Processing. In ICDE 2000.Google ScholarGoogle Scholar
  9. R. Elmasri and S. B. Navathe. Fundamentals of Database Systems. Addison Wesley, 3rd edition, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. A. Farris, A. Sharma, C. Niedermayr, D. Brat, D. Foran, F. Wang, J. Saltz, J. Kong, L. Cooper, T. Oh, T. Kurc, T. Pan, and W. Chen. A Data Model and Database for High-resolution Pathology Analytical Image Informatics. Journal of Pathology Informatics, 2(1):32, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  11. Y. J. García, M. A. López, and S. T. Leutenegger. A Greedy Algorithm for Bulk Loading R-trees. In GIS '96.Google ScholarGoogle Scholar
  12. S. Gnanakaran, H. Nymeyer, J. Portman, K. Y. Sanbonmatsu, and A. E. Garcia. Peptide folding simulations. Current Opinion in Structural Biology, 13(2):168--174, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  13. A. Guttman. R-trees: a Dynamic Index Structure for Spatial Searching. In SIGMOD '84. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. O. P. Hamill, A. Marty, E. Neher, B. Sakmann, and F. J. Sigworth. Improved Patch-clamp Techniques for High-resolution Current Recording from Cells and Cell-free Membrane Patches. Pflügers Archiv European Journal of Physiology, 391:85--100, 1981.Google ScholarGoogle ScholarCross RefCross Ref
  15. E. H. Jacox and H. Samet. Spatial Join Techniques. ACM TODS '07. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. I. Kamel and C. Faloutsos. Hilbert R-tree: An Improved R-tree using Fractals. In VLDB '94. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. N. Koudas and K. C. Sevcik. Size Separation Spatial Join. In SIGMOD '97. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. J. Kozloski, K. Sfyrakis, S. Hill, F. Schürmann, C. Peck, and H. Markram. Identifying, Tabulating, and Analyzing Contacts Between Branched Neuron Morphologies. IBM Journal of Research and Development, 52(1/2):43--55, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. S. Leutenegger, M. Lopez, and J. Edgington. STR: a Simple and Efficient Algorithm for R-Tree Packing. In ICDE '97. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. M.-L. Lo and C. V. Ravishankar. Spatial Hash-Joins. In SIGMOD '96. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. M.-L. Lo and C. V. Ravishankar. Spatial Joins Using Seeded Trees. In SIGMOD '94. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. G. Luo, J. F. Naughton, and C. J. Ellmann. A non-blocking parallel spatial join algorithm. In ICDE, pages 697--705, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. N. Mamoulis and D. Papadias. Slot Index Spatial Join. IEEE TKDE, 15(1):211--231, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. P. Mishra and M. H. Eich. Join Processing in Relational Databases. ACM Computing Surveys, 24(1):63--113, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. S. Nobari, T.-T. Cao, P. Karras, and S. Bressan. Scalable parallel minimum spanning forest computation. In Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of Parallel Programming, PPoPP '12, pages 205--214, New York, NY, USA, 2012. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. J. Orenstein. A Comparison of Spatial Query Processing Techniques for Native and Parameter Spaces. In SIGMOD '90. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. J. M. Patel and D. J. DeWitt. Partition Based Spatial-Merge Join. In SIGMOD '96. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. F. Preparata and M. Shamos. Computational Geometry: An Introduction. Springer, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. T. K. Sellis, N. Roussopoulos, and C. Faloutsos. The R++-Tree: A Dynamic Index for Multi-Dimensional Objects. In VLDB '87. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. M. Ubell. The Montage Extensible DataBlade Architecture. In SIGMOD '94. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. TOUCH: in-memory spatial join by hierarchical data-oriented partitioning

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          SIGMOD '13: Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
          June 2013
          1322 pages
          ISBN:9781450320375
          DOI:10.1145/2463676

          Copyright © 2013 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 22 June 2013

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          SIGMOD '13 Paper Acceptance Rate76of372submissions,20%Overall Acceptance Rate785of4,003submissions,20%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader