skip to main content
research-article

The HV-tree: a memory hierarchy aware version index

Published:01 September 2010Publication History
Skip Abstract Section

Abstract

The huge amount of temporal data generated from many important applications call for a highly efficient and scalable version index. The TSB-tree has the potential of large scalability due to its unique feature of progressive migration of data to larger mediums. However, its traditional design optimized for two levels of the memory hierarchy (the main memory and the hard disk) undermines its potential for high efficiency in face of today's advances in hardware, especially CPU/cache speed and memory size. We propose a novel version index structure called the HV-tree. Different from all previous version index structures, the HV-tree has nodes of different sizes, each optimized for a level of the memory hierarchy. As data migrates to different levels of the memory hierarchy, the HV-tree will adjust the node size automatically to exploit the best performance of all levels of the memory hierarchy. Moreover, the HV-tree has a unique chain mechanism to maximally keep recent data in higher levels of the memory hierarchy. As a result, HV-tree is several times faster than the TSB-tree for point queries (query with single key and single time value), and up to 1000 times faster than the TSB-tree for key-range and time-range queries.

References

  1. http://www.sdss.org/.Google ScholarGoogle Scholar
  2. C.-H. Ang and K.-P. Tan. The interval b-tree. Inf. Process. Lett., 53(2):85--89, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. R. Balasubramonian, D. Albonesi, A. Buyuktosunoglu, and S. Dwarkadas. Memory hierarchy reconfiguration for energy and performance in general-purpose processor architectures. In MICRO, pages 245--257, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. B. Becker, S. Gschwind, T. Ohler, B. Seeger, and P. Widmayer. An asymptotically optimal multiversion b-tree. VLDB Journal, 5(4):264--275, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. P. A. Bernstein, M. L. Brodie, S. Ceri, D. J. DeWitt, M. J. Franklin, H. Garcia-Molina, J. Gray, G. Held, J. M. Hellerstein, H. V. Jagadish, M. Lesk, D. Maier, J. F. Naughton, H. Pirahesh, M. Stonebraker, and J. D. Ullman. The asilomar report on database research. SIGMOD Record, 27(4):74--80, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. M. C. Easton. Key-sequence data sets on inedible storage. IBM Journal of Research and Development, 30(3):230--241, 1986. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. R. Elmasri, G. T. J. Wuu, and V. Kouramajian. The time index and the monotonic b+-tree. In Temporal Databases, pages 433--456. 1993.Google ScholarGoogle Scholar
  8. A. Guttman. R-trees: a dynamic index structure for spatial searching. In SIGMOD, pages 47--57, 1984. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. R. A. Hankins and J. M. Patel. Effect of node size on the performance of cache-conscious b+-trees. In SIGMETRICS, pages 283--294. ACM, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. P.-Å. Larson. Performance analysis of linear hashing with partial expansions. TODS, 7(4):566--587, 1982. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. D. B. Lomet. Partial expansions for file organizations with an index. TODS, 12(1):65--84, 1987. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. D. B. Lomet, M. Hong, R. V. Nehme, and R. Zhang. Transaction time indexing with version compression. PVLDB, 1(1), 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. D. B. Lomet and B. Salzberg. Access methods for multiversion data. In SIGMOD Conference, pages 315--324, 1989. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. D. B. Lomet and B. Salzberg. The performance of a multiversion access method. In SIGMOD Conference, pages 353--363, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. J. Rao and K. A. Ross. Cache conscious indexing for decision-support in main memory. In VLDB, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. J. Rao and K. A. Ross. Making b+ -trees cache conscious in main memory. In SIGMOD Conference, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. A. Shatdal, C. Kant, and J. F. Naughton. Cache conscious algorithms for relational query processing. In VLDB, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Y. Tao and D. Papadias. Mv3r-tree: A spatio-temporal access method for timestamp and interval queries. In VLDB, pages 431--440, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. P. Westerman. Data warehousing: using the Wal-Mart model. Morgan Kaufmann, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. M. E. Wolf and M. S. Lam. A data locality optimizing algorithm. SIGPLAN Not., 26(6):30--44, 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. The HV-tree: a memory hierarchy aware version index
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image Proceedings of the VLDB Endowment
          Proceedings of the VLDB Endowment  Volume 3, Issue 1-2
          September 2010
          1658 pages

          Publisher

          VLDB Endowment

          Publication History

          • Published: 1 September 2010
          Published in pvldb Volume 3, Issue 1-2

          Qualifiers

          • research-article

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader