skip to main content
10.1145/2047196.2047247acmconferencesArticle/Chapter ViewAbstractPublication PagesuistConference Proceedingsconference-collections
research-article

ReVision: automated classification, analysis and redesign of chart images

Published:16 October 2011Publication History

ABSTRACT

Poorly designed charts are prevalent in reports, magazines, books and on the Web. Most of these charts are only available as bitmap images; without access to the underlying data it is prohibitively difficult for viewers to create more effective visual representations. In response we present ReVision, a system that automatically redesigns visualizations to improve graphical perception. Given a bitmap image of a chart as input, ReVision applies computer vision and machine learning techniques to identify the chart type (e.g., pie chart, bar chart, scatterplot, etc.). It then extracts the graphical marks and infers the underlying data. Using a corpus of images drawn from the web, ReVision achieves image classification accuracy of 96% across ten chart categories. It also accurately extracts marks from 79% of bar charts and 62% of pie charts, and from these charts it successfully extracts data from 71% of bar charts and 64% of pie charts. ReVision then applies perceptually-based design principles to populate an interactive gallery of redesigned charts. With this interface, users can view alternative chart designs and retarget content to different visual styles.

References

  1. A. Bosch, A. Zisserman, and X. Munoz. Scene classification via pLSA. Computer Vision--ECCV, pages 517--530, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. M. Bostock and J. Heer. Protovis: A graphical toolkit for visualization. IEEE Trans Visualization & Comp Graphics, 15(6):1121--1128, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. Boutell, C. Brown, and J. Luo. Review of the state of the art in semantic scene classification. Rochester, NY, USA, Tech. Rep, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. D. Chen, J. Odobez, and H. Bourlard. Text detection and recognition in images and video frames. Pattern Recognition, 37(3):595--608, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  5. W. S. Cleveland. Visualizing Data. Hobart Press, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. W. S. Cleveland and R. McGill. Graphical perception: Theory, experimentation, and application to the development of graphical methods. Journal of the American Statistical Association, 79(387):531--554, 1984.Google ScholarGoogle ScholarCross RefCross Ref
  7. A. Coates, H. Lee, and A. Ng. An Analysis of Single-Layer Networks in Unsupervised Feature Learning. Advances in Neural Information Processing Systems, 2010.Google ScholarGoogle Scholar
  8. C. Cortes and V. Vapnik. Support-vector networks. Machine Learning, 20:273--297, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. N. Dalal and B. Triggs. Histograms of oriented gradients for human detection. In IEEE CVPR, pages 886--893, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. G. David. Distinctive image features from scale-invariant keypoints. Intl Journal Comp Vision, 60(2):91--110, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. S. Few. Show Me the Numbers: Designing Tables and Graphs to Enlighten. Analytics Press, Berkeley, CA, 2004.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. M. A. Fischler and R. C. Bolles. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM, 24:381--395, June 1981. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. A. Fitzgibbon, M. Pilu, and R. Fisher. Direct least square fitting of ellipses. IEEE Trans Pattern Analysis & Machine Intelligence, 21(5):476 --480, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. S. R. Garner. Weka: The waikato environment for knowledge analysis. In In Proc. of the New Zealand Computer Science Research Students Conference, pages 57--64, 1995.Google ScholarGoogle Scholar
  15. M. Harrower and C. Brewer. Colorbrewer.org: an online tool for selecting colour schemes for maps. The Cartographic Journal, 40(1):27--37, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  16. J. Heer and M. Bostock. Crowdsourcing graphical perception: Using Mechanical Turk to assess visualization design. In ACM CHI, pages 203--212, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. W. Huang and C. L. Tan. A system for understanding imaged infographics and its applications. In Proceedings of the 2007 ACM symposium on Document engineering, DocEng '07, pages 9--18, New York, NY, USA, 2007. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. W. Huang, C. L. Tan, and W. K. Leow. Model-based chart image recognition. In J. Lladós and Y.-B. Kwon, editors, Graphics Recognition, volume 3088 of Lecture Notes in Computer Science, pages 87--99. Springer Berlin / Heidelberg, 2004.Google ScholarGoogle Scholar
  19. R. Liu, W. Huang, and C. L. Tan. Extraction of vectorized graphical information from scientific chart images. In Document Analysis & Recognition (ICDAR), pages 521--525, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. J. D. Mackinlay. Automating the design of graphical presentations of relational information. ACM Trans on Graphics, 5(2):110--141, 1986. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. J. D. Mackinlay, P. Hanrahan, and C. Stolte. Show me: Automatic presentation for visual analysis. IEEE Trans Visualization & Comp Graphics, 13(6):1137 --1144, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. V. Prasad, B. Siddiquie, J. Golbeck, and L. Davis. Classifying Computer Generated Charts. In Content-Based Multimedia Indexing Workshop, pages 85--92. IEEE, 2007.Google ScholarGoogle Scholar
  23. M. Shao and R. Futrelle. Recognition and classification of figures in pdf documents. In W. Liu and J. Lladós, editors, Graphics Recognition. Ten Years Review and Future Perspectives, volume 3926 of Lecture Notes in Computer Science, pages 231--242. Springer Berlin / Heidelberg, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. D. Simkin and R. Hastie. An information-processing analysis of graph perception. Journal of the American Statistical Association, 82(398):454--465, 1987.Google ScholarGoogle ScholarCross RefCross Ref
  25. C. Stolte, D. Tang, and P. Hanrahan. Polaris: a system for query, analysis, and visualization of multidimensional relational databases. IEEE Trans Visualization & Comp Graphics, 8(1):52 --65, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. M. Stone. A Field Guide to Digital Color. A. K. Peters, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. C. Tomasi and R. Manduchi. Bilateral filtering for gray and color images. In ICCV, pages 839 --846, Jan. 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. E. R. Tufte. The Visual Display of Quantitative Information. Graphics Press, 1983. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. J. Yang, Y.-G. Jiang, A. G. Hauptmann, and C.-W. Ngo. Evaluating bag-of-visual-words representations in scene classification. In Workshop on Multimedia Information Retrieval, pages 197--206, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. L. Yang, W. Huang, and C. Tan. Semi-automatic ground truth generation for chart image recognition. In Document Analysis Systems VII, volume 3872 of Lecture Notes in Computer Science, pages 324--335. 2006. Google ScholarGoogle Scholar
  31. Y. P. Zhou and C. L. Tan. Hough technique for bar charts detection and recognition in document images. In Intl Conf on Image Processing, pages 605--608, sept. 2000.Google ScholarGoogle Scholar

Index Terms

  1. ReVision: automated classification, analysis and redesign of chart images

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      UIST '11: Proceedings of the 24th annual ACM symposium on User interface software and technology
      October 2011
      654 pages
      ISBN:9781450307161
      DOI:10.1145/2047196

      Copyright © 2011 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 16 October 2011

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      UIST '11 Paper Acceptance Rate67of262submissions,26%Overall Acceptance Rate842of3,967submissions,21%

      Upcoming Conference

      UIST '24

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader