skip to main content
10.1145/3400903.3401689acmotherconferencesArticle/Chapter ViewAbstractPublication PagesssdbmConference Proceedingsconference-collections
demonstration

DocDesign: Cost-Based Database Design for Document Stores

Published:30 July 2020Publication History

ABSTRACT

Document stores have become one of the most popular NoSQL systems, mainly due to their semi-structured data storage structure and well-developed query capabilities. The semi-structured nature allows them to have database designs beyond traditional normalization theories. This makes the database design decisions more complicated with a myriad of possibilities. Thus, the database design process for them has resorted to ad-hoc trial and error methods. However, having a good database design is essential for any data storage system’s performance, and bad design decisions cannot always be compensated by adding more powerful hardware. Thus, in this work, we propose DocDesign, a decision aid tool for document store database design. DocDesign allows its users to evaluate different database designs for data storage requirements under a particular workload. Through DocDesign, users can make informed decisions for a design by evaluating the estimated storage statistics and query runtimes without testing it on an actual document store. DocDesign also generates design specific queries for the input workload. This not only cuts down the time and the effort taken in design decision making and development but also save money spent on fixing poor designs in the long run. On-site, we will showcase how DocDesign facilitates the design decision-making process for MongoDB with both synthetic and real-world examples.

References

  1. Francesca Bugiotti, Luca Cabibbo, Paolo Atzeni, and Riccardo Torlone. 2014. Database design for NoSQL systems. In Int. Conf. on Conceptual Modeling. ER.Google ScholarGoogle ScholarCross RefCross Ref
  2. Rick Cattell. 2010. Scalable SQL and NoSQL data stores. SIGMOD Record 39, 4 (2010), 12–27.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Moditha Hewasinghage, Alberto Abelló, Jovan Varga, and Esteban Zimányi. [n.d.]. A Cost Model for Random Access Queries in Document Stores (Under review).Google ScholarGoogle Scholar
  4. Moditha Hewasinghage, Jovan Varga, Alberto Abelló, and Esteban Zimányi. 2018. Managing Polyglot Systems Metadata with Hypergraphs. In Int. Conf. on Conceptual Modeling. ER, 463–478.Google ScholarGoogle Scholar
  5. Kalervo Järvelin and Jaana Kekäläinen. 2017. IR evaluation methods for retrieving highly relevant documents. SIGIR Forum 51, 2 (2017), 243–250.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Sin Yeung Lee, Mong-Li Lee, Tok Wang Ling, and Leonid A. Kalinichenko. 1999. Designing Good Semi-Structured Databases and Conceptual Modeling. In Int. Conf. on Conceptual Modeling. ER, 131–145.Google ScholarGoogle Scholar
  7. Sam Lightstone, Toby J. Teorey, and Thomas P. Nadeau. 2007. Physical Database Design: the database professional’s guide to exploiting indexes, views, storage, and more. Morgan Kaufmann.Google ScholarGoogle Scholar
  8. Erik Meijer and Gavin M. Bierman. 2011. A co-relational model of data for large shared data banks. Commun. ACM 54, 4 (2011), 49–58.Google ScholarGoogle ScholarDigital LibraryDigital Library

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    SSDBM '20: Proceedings of the 32nd International Conference on Scientific and Statistical Database Management
    July 2020
    241 pages
    ISBN:9781450388146
    DOI:10.1145/3400903

    Copyright © 2020 Owner/Author

    Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 30 July 2020

    Check for updates

    Qualifiers

    • demonstration
    • Research
    • Refereed limited

    Acceptance Rates

    Overall Acceptance Rate56of146submissions,38%

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format