skip to main content
research-article

OceanST: a distributed analytic system for large-scale spatiotemporal mobile broadband data

Published:01 August 2014Publication History
Skip Abstract Section

Abstract

With the increasing prevalence of versatile mobile devices and the fast deployment of broadband mobile networks, a huge volume of Mobile Broadband (MBB) data has been generated over time. The MBB data naturally contain rich information of a large number of mobile users, covering a considerable fraction of whole population nowadays, including the mobile applications they are using at different locations and time; the MBB data may present the unprecedentedly large knowledge base of human behavior which has highly recognized commercial and social value. However, the storage, management and analysis of the huge and fast growing volume of MBB data post new and significant challenges to the industrial practitioners and research community. In this demonstration, we present a new, MBB data tailored, distributed analytic system named OceanST which has addressed a series of problems and weaknesses of the existing systems, originally designed for more general purpose and capable to handle MBB data to some extent. OceanST is featured by (i) efficiently loading of ever-growing MBB data, (ii) a bunch of spatiotemporal aggregate queries and basic analysis APIs frequently found in various MBB data application scenarios, and (iii) sampling-based approximate solution with provable accuracy bound to cope with huge volume of MBB data. The demonstration will show the advantage of OceanST in a cluster of 5 machines using 3TB data.

References

  1. Mongodb. www.mongodb.org.Google ScholarGoogle Scholar
  2. Sin chew daily. http://news.sinchew.com.my/node/318251.Google ScholarGoogle Scholar
  3. A. Aji, F. Wang, H. Vo, R. Lee, Q. Liu, X. Zhang, and J. Saltz. Hadoop gis: A high performance spatial data warehousing system over mapreduce. In Proceedings of VLDB Endowment, pages 1009--1020, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. A. Eldawy and M. F. Mokbel. A demonstration of spatialhadoop: An efficient mapreduce framework for spatial data. In Proceedings of VLDB Endowment, pages 1230--1233, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Y. Li, M. Steiner, J. Bao, L. Wang, and T. Zhu. Region sampling and estimation of geosocial data with dynamic range calibration. In Proceedings of ICDE, pages 1--12. IEEE, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  6. H. Tan, W. Luo, and L. M. Ni. Clost: a hadoop-based storage system for big spatio-temporal data analytics. In Proceedings of CIKM, pages 2139--2143. ACM, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. M. Zaharia, M. Chowdhury, M. J. Franklin, S. Shenker, and I. Stoica. Spark: cluster computing with working sets. In Proceedings of the 2nd USENIX conference on Hot topics in cloud computing, pages 10--10, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. OceanST: a distributed analytic system for large-scale spatiotemporal mobile broadband data
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image Proceedings of the VLDB Endowment
      Proceedings of the VLDB Endowment  Volume 7, Issue 13
      August 2014
      466 pages
      ISSN:2150-8097
      Issue’s Table of Contents

      Publisher

      VLDB Endowment

      Publication History

      • Published: 1 August 2014
      Published in pvldb Volume 7, Issue 13

      Qualifiers

      • research-article

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader