skip to main content
research-article

A data management method for databases using hybrid storage systems

Published:08 April 2019Publication History
Skip Abstract Section

Abstract

When applications require high I/O performance, solid-state drives (SSDs) are often preferable because they perform better than traditional hard-disk drives (HDDs). Therefore, database system response time can be improved by moving frequently used data to SSDs. However, because capacity is both more limited and more expensive for SSDs than it is for HDDs, there is a need to determine which data are most appropriate to migrate from HDDs to SSDs. In this paper, we propose a data management method for databases using SSDs by considering an integrated mechanism and a migration rule for performing migrations between HDDs and SSDs. The proposed method can provide database systems with high I/O performance by properly moving high-priority data to SSDs with fast access capabilities while maintaining the low-priority data on HDDs, which have lower costs. The results of experiments show that the proposed method can achieve the goal while requiring only small amounts of SSD capacity.

References

  1. B-cache Repository. http://bcache.evilpiepirate.org/.Google ScholarGoogle Scholar
  2. FlashCache Repository. https://github.com/facebook/flashcache.Google ScholarGoogle Scholar
  3. Hammerdb is an open source database load testing and benchmarking tool.Google ScholarGoogle Scholar
  4. Tpc-c is an on-line transaction processing benchmark.Google ScholarGoogle Scholar
  5. Tpc-h is a decision support benchmark.Google ScholarGoogle Scholar
  6. Performance value of solid state drives using IBM i Repository. http://www-03.ibm.com/systems/resources/, May 2009.Google ScholarGoogle Scholar
  7. B. H. Bloom. Space/time trade-offs in hash coding with allowable errors. volume 13(7), pages 422--426, July 1970. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. M. Canim, G. A. Mihaila, B. Bhattacharjee, K. A. Ross, and C. A. Lang. An object placement advisor for db2 using solid state storage. In Proceedings of the VLDB Endowment, volume 2, pages 1318--1329, Aug 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. M. Canim, G. A. Mihaila, B. Bhattacharjee, K. A. Ross, and C. A. Lang. Ssd bufferpool extensions for database systems. In Proceedings of the VLDB Endowment, volume 3, pages 1435--1446, September 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. J. Choi, B. Lee, D. Jung, and H. Y. Youn. An ssd-based accelerator using partitioned bloom filter for directory parsing. In IEEE International Conference on IT Convergence and Security, pages 1--5, August 2015.Google ScholarGoogle ScholarCross RefCross Ref
  11. J. Goldstein and P. ÃĚke Larson. Optimizing queries using materialized views: a practical, scalable solution. In ACM SIGMOD international conference on Management of data, pages 331--342, June 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. J. W. Hsieh, L. P. Chang, and T. W. Kuo. Efficient identification of hot data for flash memory storage systems. ACM Transactions on Storage, 2(1), 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. C.-K. Kang, Y.-J. Cai, C.-H. Wu, , and P.-C. Hsiu. A hybrid storage access framework for high-performance virtual machines. ACM Transactions on Embedded Computing Systems, 13(5s), 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. A. Katsifodimos, I. Manolescu, and V. Vassalos. Materialized view selection for xquery workloads. In ACM SIGMOD International Conference on Management of Data, pages 565--576, May 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. L. Lin, Y. Zhu, J. Yue, Z. Cai, and B. Segee. Hot random off-loading: A hybrid storage system with dynamic data migration. In 19th Annual IEEE International Symposium on Modelling, Analysis, and Simulation of Computer and Telecommunication Systems, pages 318--325, July 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. M. Mehta and D. J. DeWitt. Data placement in shared-nothing parallel database systems. volume 6, pages 53--72, February 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Oracle. Take the guesswork out of database layout and i/o tuning with automatic storage management. In Oracle Technical White Paper, December 2005.Google ScholarGoogle Scholar
  18. J. Ou, J. Shu, Y. Lu, L. Yi, , and W. Wang. Edm: an endurance-aware data migration scheme for load balancing in ssd storage clusters. In IEEE International Parallel and Distributed Processing Symposium, pages 787--796, May 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. D. Park. Hot and cold data identification: Applications to storage devices and systems. In Ph.D. dissertation, UNIVERSITY OF MINNESOTA, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. D. Park and D. H. Du. Hot data identification for flash-based storage systems using multiple bloom filters. In Mass Storage Systems and Technologies (MSST), pages 1--11, May 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. A. Sachedina, M. Huras, and A. Colangelo. Best practices database storage. In White paper, IBM DB2 for Linux, UNIX, and Windows, Oct 2008.Google ScholarGoogle Scholar
  22. J. Schindler, A. Ailamaki, and G. R. Ganger. Matching database access patterns to storage characteristics. In In FAST '02:Proceedings of the 1st USENIX Conference on Fileand Storage Technologies, page 22, 2002.Google ScholarGoogle Scholar
  23. H. Shi, R. V. Arumugam, C. H. Foh, and K. K. Khaing. Optimal disk storage allocation for multitier storage system. IEEE Transactions on magnetics, 49(6), June 2013.Google ScholarGoogle Scholar
  24. C.-H. Wu, P.-H. Wu, K.-L. Chen, W.-Y. Chang, and K.-C. Lai. A hotness filter of files for reliable non-volatile memory systems. IEEE Transactions on Dependable and Secure Computing, 12(4), July 2015.Google ScholarGoogle ScholarCross RefCross Ref
  25. C. Xu, W. Wang, D. Zhou, and T. Xie. An ssd-hdd integrated storage architecture for write-once-read-once applications on clusters. In IEEE International Conference on Cluster Computing, pages 74--77, September 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. C. Zhu, Q. Zhu, C. Zuzarte, and W. Ma. A materialized-view based technique to optimize progressive queries via dependency analysis. In Proceedings of the Conference of the Center for Advanced Studies on Collaborative Research, pages 60--73, November 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. A data management method for databases using hybrid storage systems

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader