skip to main content
research-article

ASTERIX: an open source system for "Big Data" management and analysis (demo)

Published:01 August 2012Publication History
Skip Abstract Section

Abstract

At UC Irvine, we are building a next generation parallel database system, called ASTERIX, as our approach to addressing today's "Big Data" management challenges. ASTERIX aims to combine time-tested principles from parallel database systems with those of the Web-scale computing community, such as fault tolerance for long running jobs. In this demo, we present a whirlwind tour of ASTERIX, highlighting a few of its key features. We will demonstrate examples of our data definition language to model semi-structured data, and examples of interesting queries using our declarative query language. In particular, we will show the capabilities of ASTERIX for answering geo-spatial queries and fuzzy queries, as well as ASTERIX' data feed construct for continuously ingesting data.

References

  1. ASTERIX Website. http://asterix.ics.uci.edu/.Google ScholarGoogle Scholar
  2. Apache Hive, http://hadoop.apache.org/hive.Google ScholarGoogle Scholar
  3. A. Behm, V. R. Borkar, M. J. Carey, R. Grover, C. Li, N. Onose, R. Vernica, A. Deutsch, Y. Papakonstantinou, and V. J. Tsotras. Asterix: Towards a Scalable, Semistructured Data Platform for Evolving-World Models. Distributed and Parallel Databases, 29(3):185--216, 2011. Google ScholarGoogle Scholar
  4. V. R. Borkar, M. J. Carey, R. Grover, N. Onose, and R. Vernica. Hyracks: A flexible and extensible foundation for data-intensive computing. In ICDE, pages 1151--1162, 2011. Google ScholarGoogle Scholar
  5. Jaql, http://www.jaql.org.Google ScholarGoogle Scholar
  6. JSON. http://www.json.org/.Google ScholarGoogle Scholar
  7. Object database management systems. http://www.odbms.org/odmg/.Google ScholarGoogle Scholar
  8. C. Olston, B. Reed, U. Srivastava, R. Kumar, and A. Tomkins. Pig Latin: a Not-so-Foreign Language for Data Processing. In SIGMOD, pages 1099--1110, 2008. Google ScholarGoogle Scholar
  9. R. Ramakrishnan and J. Gehrke. Database Management Systems. WCB/McGraw-Hill, 2002. Google ScholarGoogle Scholar
  10. R. Vernica, M. J. Carey, and C. Li. Efficient parallel set-similarity joins using MapReduce. In SIGMOD, pages 495--506, 2010. Google ScholarGoogle Scholar
  11. XQuery 1.0: An XML query language. http://www.w3.org/TR/xquery/.Google ScholarGoogle Scholar

Index Terms

  1. ASTERIX: an open source system for "Big Data" management and analysis (demo)
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image Proceedings of the VLDB Endowment
      Proceedings of the VLDB Endowment  Volume 5, Issue 12
      August 2012
      340 pages

      Publisher

      VLDB Endowment

      Publication History

      • Published: 1 August 2012
      Published in pvldb Volume 5, Issue 12

      Qualifiers

      • research-article

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader