skip to main content
research-article

SeeDB: visualizing database queries efficiently

Published:01 December 2013Publication History
Skip Abstract Section

Abstract

Data scientists rely on visualizations to interpret the data returned by queries, but finding the right visualization remains a manual task that is often laborious. We propose a DBMS that partially automates the task of finding the right visualizations for a query. In a nutshell, given an input query Q, the new DBMS optimizer will explore not only the space of physical plans for Q, but also the space of possible visualizations for the results of Q. The output will comprise a recommendation of potentially "interesting" or "useful" visualizations, where each visualization is coupled with a suitable query execution plan. We discuss the technical challenges in building this system and outline an agenda for future research.

References

  1. K. Chakrabarti et al. Approximate query processing using wavelets. In VLDB, pages 111--122, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. G. Cormode and S. Muthukrishnan. An improved data stream summary: the count-min sketch and its applications. J. Algorithms, 55(1): 58--75, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. P. B. Gibbons. Distinct sampling for highly-accurate answers to distinct values queries and event reports. In VLDB, pages 541--550, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. H. Gonzalez et al. Google fusion tables: web-centered data management and collaboration. In SIGMOD Conference, pages 1061--1066, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. J. M. Hellerstein, P. J. Haas, and H. J. Wang. Online aggregation. In J. Peckham, editor, SIGMOD 1997, pages 171--182. ACM Press, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. C. Jermaine et al. Scalable approximate query processing with the dbo engine. ACM Trans. Database Syst., 33(4), 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. S. Kandel et al. Wrangler: interactive visual specification of data transformation scripts. In CHI, pages 3363--3372, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. S. Kandel et al. Profiler: integrated statistical analysis and visualization for data quality assessment. In AVI, pages 547--554, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. M. Livny et al. Devise: Integrated querying and visualization of large datasets. In SIGMOD Conference, pages 301--312, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. J. D. Mackinlay et al. Show me: Automatic presentation for visual analysis. IEEE Trans. Vis. Comput. Graph., 13(6): 1137--1144, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. A. Parameswaran, N. Polyzotis, and H. Garcia-Molina. SeeDB: Visualizing Database Queries Efficiently. Stanford Infolab, 2013.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. S. Sarawagi. Explaining differences in multidimensional aggregates. In VLDB, pages 42--53, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. S. Sarawagi. User-adaptive exploration of multidimensional data. In VLDB, pages 307--316, 2000.Google ScholarGoogle Scholar
  14. G. Sathe and S. Sarawagi. Intelligent rollups in multidimensional olap data. In VLDB, pages 531--540, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. T. K. Sellis. Multiple-query optimization. ACM TODS, 13(1): 23--52, 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. C. Stolte et al. Polaris: a system for query, analysis, and visualization of multidimensional databases. Commun. ACM, 51(11): 75--84, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. C. Wang and H.-W. Shen. Information theory in scientific visualization. Entropy, 13(1): 254--273, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  18. Wikipedia. Jensen shannon divergence --- wikipedia, the free encyclopedia, 2013. {Online; accessed 16-July-2013}.Google ScholarGoogle Scholar
  19. Wikipedia. Kullback leibler divergence --- wikipedia, the free encyclopedia, 2013. {Online; accessed 16-July-2013}.Google ScholarGoogle Scholar
  20. Wikipedia. Statistical distance --- wikipedia, the free encyclopedia, 2013. {Online; accessed 16-July-2013}.Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in

Full Access

  • Published in

    cover image Proceedings of the VLDB Endowment
    Proceedings of the VLDB Endowment  Volume 7, Issue 4
    December 2013
    112 pages

    Publisher

    VLDB Endowment

    Publication History

    • Published: 1 December 2013
    Published in pvldb Volume 7, Issue 4

    Qualifiers

    • research-article

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader