Skip to main content
Top

2016 | OriginalPaper | Chapter

FigureSeer: Parsing Result-Figures in Research Papers

Authors : Noah Siegel, Zachary Horvitz, Roie Levin, Santosh Divvala, Ali Farhadi

Published in: Computer Vision – ECCV 2016

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

‘Which are the pedestrian detectors that yield a precision above 95 % at 25 % recall?’ Answering such a complex query involves identifying and analyzing the results reported in figures within several research papers. Despite the availability of excellent academic search engines, retrieving such information poses a cumbersome challenge today as these systems have primarily focused on understanding the text content of scholarly documents. In this paper, we introduce FigureSeer, an end-to-end framework for parsing result-figures, that enables powerful search and retrieval of results in research papers. Our proposed approach automatically localizes figures from research papers, classifies them, and analyses the content of the result-figures. The key challenge in analyzing the figure content is the extraction of the plotted data and its association with the legend entries. We address this challenge by formulating a novel graph-based reasoning approach using a CNN-based similarity metric. We present a thorough evaluation on a real-word annotated dataset to demonstrate the efficacy of our approach.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Literature
1.
go back to reference Khabsa, M., Giles, C.L.: The number of scholarly documents on the public web. PLoS ONE 9(5), e93949 (2014)CrossRef Khabsa, M., Giles, C.L.: The number of scholarly documents on the public web. PLoS ONE 9(5), e93949 (2014)CrossRef
3.
go back to reference Tufte, E.R.: Visual display of quantitative information. In: Graphics Press, Cheshire (1983) Tufte, E.R.: Visual display of quantitative information. In: Graphics Press, Cheshire (1983)
4.
go back to reference Grice, P.: Logic and conversation. In: Speech Acts (1975) Grice, P.: Logic and conversation. In: Speech Acts (1975)
5.
go back to reference Heer, J., et al.: Crowdsourcing graphical perception: using mechanical turk to assess visualization design. In: CHI (2010) Heer, J., et al.: Crowdsourcing graphical perception: using mechanical turk to assess visualization design. In: CHI (2010)
6.
go back to reference Savva, M., et al.: ReVision: automated classification, analysis and redesign of chart images. In: UIST (2011) Savva, M., et al.: ReVision: automated classification, analysis and redesign of chart images. In: UIST (2011)
10.
go back to reference Choudhury, S.R., et al.: Automatic extraction of figures from scholarly documents. In: DocEng (2015) Choudhury, S.R., et al.: Automatic extraction of figures from scholarly documents. In: DocEng (2015)
11.
go back to reference Clark, C., Divvala, S.: Looking beyond text: extracting figures, tables, and captions from computer science paper. In: AAAI Workshop (2015) Clark, C., Divvala, S.: Looking beyond text: extracting figures, tables, and captions from computer science paper. In: AAAI Workshop (2015)
12.
go back to reference Kuhn, T., et al.: Finding and accessing diagrams in biomedical publications. In: AMIA (2012) Kuhn, T., et al.: Finding and accessing diagrams in biomedical publications. In: AMIA (2012)
13.
go back to reference Choudhury, S.R., Giles, C.L.: An architecture for information extraction from figures in digital libraries. In: WWW (Companion Volume) (2015) Choudhury, S.R., Giles, C.L.: An architecture for information extraction from figures in digital libraries. In: WWW (Companion Volume) (2015)
14.
go back to reference Chan, J., et al.: Searching off-line arabic documents. In: CVPR (2006) Chan, J., et al.: Searching off-line arabic documents. In: CVPR (2006)
15.
go back to reference Liu, Y., et al.: Tableseer: automatic table metadata extraction and searching in digital libraries. In: JCDL (2007) Liu, Y., et al.: Tableseer: automatic table metadata extraction and searching in digital libraries. In: JCDL (2007)
16.
go back to reference Kae, A., et al.: Improving state-of-the-art OCR through high-precision document-specific modeling. In: CVPR (2010) Kae, A., et al.: Improving state-of-the-art OCR through high-precision document-specific modeling. In: CVPR (2010)
17.
go back to reference Wu, J., et al.: CiteseerX: AI in a digital library search engine. In: AAAI (2014) Wu, J., et al.: CiteseerX: AI in a digital library search engine. In: AAAI (2014)
20.
go back to reference Wu, P., Carberry, S., Elzer, S., Chester, D.: Recognizing the intended message of line graphs. In: Goel, A.K., Jamnik, M., Narayanan, N.H. (eds.) Diagrams 2010. LNCS (LNAI), vol. 6170, pp. 220–234. Springer, Heidelberg (2010). doi:10.1007/978-3-642-14600-8_21 CrossRef Wu, P., Carberry, S., Elzer, S., Chester, D.: Recognizing the intended message of line graphs. In: Goel, A.K., Jamnik, M., Narayanan, N.H. (eds.) Diagrams 2010. LNCS (LNAI), vol. 6170, pp. 220–234. Springer, Heidelberg (2010). doi:10.​1007/​978-3-642-14600-8_​21 CrossRef
21.
go back to reference Xu, S., McCusker, J., Krauthammer, M.: Yale image finder (YIF): a new search engine for retrieving biomedical images. Bioinformatics 24(17), 1968–1970 (2008)CrossRef Xu, S., McCusker, J., Krauthammer, M.: Yale image finder (YIF): a new search engine for retrieving biomedical images. Bioinformatics 24(17), 1968–1970 (2008)CrossRef
22.
go back to reference Choudhury, S., et al.: A figure search engine architecture for a chemistry digital library. In: JCDL (2013) Choudhury, S., et al.: A figure search engine architecture for a chemistry digital library. In: JCDL (2013)
23.
go back to reference Li, Z., et al.: Towards retrieving relevant information graphics. In: SIGIR (2013) Li, Z., et al.: Towards retrieving relevant information graphics. In: SIGIR (2013)
24.
go back to reference Krizhevsky, A., et al.: Imagenet classification with deep convolutional neural networks. In: NIPS (2012) Krizhevsky, A., et al.: Imagenet classification with deep convolutional neural networks. In: NIPS (2012)
25.
go back to reference He, K., et al.: Deep residual learning for image recognition. In: CVPR (2016) He, K., et al.: Deep residual learning for image recognition. In: CVPR (2016)
26.
go back to reference Deng, J., et al.: ImageNet: a large-scale hierarchical image database. In: CVPR (2009) Deng, J., et al.: ImageNet: a large-scale hierarchical image database. In: CVPR (2009)
27.
go back to reference McCullagh, P., Nelder, J.: Generalized linear models. In: Chapman and Hall, London (1989) McCullagh, P., Nelder, J.: Generalized linear models. In: Chapman and Hall, London (1989)
28.
go back to reference Breiman, L.: Random forests. In: Machine Learning (2001) Breiman, L.: Random forests. In: Machine Learning (2001)
29.
go back to reference Felzenszwalb, P., Veksler, O.: Tiered scene labeling with dynamic programming. In: CVPR (2010) Felzenszwalb, P., Veksler, O.: Tiered scene labeling with dynamic programming. In: CVPR (2010)
30.
go back to reference Joachims, T.: Training linear svms in linear time. In: KDD (2006) Joachims, T.: Training linear svms in linear time. In: KDD (2006)
31.
go back to reference Felzenszwalb, P., et al.: Discriminatively trained, multiscale, deformable part model. In: CVPR (2008) Felzenszwalb, P., et al.: Discriminatively trained, multiscale, deformable part model. In: CVPR (2008)
32.
go back to reference Zagoruyko, S., Komodakis, N.: Learning to compare image patches via cnns. In: CVPR (2015) Zagoruyko, S., Komodakis, N.: Learning to compare image patches via cnns. In: CVPR (2015)
33.
go back to reference Han, X., et al.: MatchNet: unifying feature and metric learning for patch-based matching. In: CVPR (2015) Han, X., et al.: MatchNet: unifying feature and metric learning for patch-based matching. In: CVPR (2015)
34.
go back to reference Hadsell, R., et al.: Dimensionality reduction by learning an invariant mapping. In: CVPR (2006) Hadsell, R., et al.: Dimensionality reduction by learning an invariant mapping. In: CVPR (2006)
35.
go back to reference Shechtman, E., Irani, M.: Matching local self-similarities across images and videos. In: CVPR (2007) Shechtman, E., Irani, M.: Matching local self-similarities across images and videos. In: CVPR (2007)
36.
go back to reference Dillencourt, M.B., Samet, H., Tamminen, M.: A general approach to connected-component labeling for arbitrary image representations. J. ACM (JACM) 39(2), 253–280 (1992)MathSciNetCrossRefMATH Dillencourt, M.B., Samet, H., Tamminen, M.: A general approach to connected-component labeling for arbitrary image representations. J. ACM (JACM) 39(2), 253–280 (1992)MathSciNetCrossRefMATH
39.
go back to reference Sorokin, A., Forsyth, D.: Utility data annotation with amazon mechanical turk. In: CVPR Workshop (2008) Sorokin, A., Forsyth, D.: Utility data annotation with amazon mechanical turk. In: CVPR Workshop (2008)
40.
go back to reference Everingham, M., et al.: The PASCAL visual object classes (VOC) challenge - a retrospective. In: IJCV (2015) Everingham, M., et al.: The PASCAL visual object classes (VOC) challenge - a retrospective. In: IJCV (2015)
44.
go back to reference Hou, X., Yuille, A., Koch, C.: Boundary detection benchmarking: beyond F-measures. In: CVPR (2013) Hou, X., Yuille, A., Koch, C.: Boundary detection benchmarking: beyond F-measures. In: CVPR (2013)
45.
go back to reference Corio, M., et al.: Generation of texts for information graphics. In: EWNLG (1999) Corio, M., et al.: Generation of texts for information graphics. In: EWNLG (1999)
46.
go back to reference Carberry, S., et al.: Extending document summarization to information graphics. In: ACL Workshop (2004) Carberry, S., et al.: Extending document summarization to information graphics. In: ACL Workshop (2004)
47.
go back to reference Kulkarni, G., et al.: Baby talk: understanding and generating simple image descriptions. In: CVPR (2011) Kulkarni, G., et al.: Baby talk: understanding and generating simple image descriptions. In: CVPR (2011)
48.
go back to reference Moraes, P., et al.: Generating summaries of line graphs. In: INLG (2014) Moraes, P., et al.: Generating summaries of line graphs. In: INLG (2014)
49.
go back to reference Chen, X., Zitnick, C.: A recurrent visual representation for image caption generation. In: CVPR (2015) Chen, X., Zitnick, C.: A recurrent visual representation for image caption generation. In: CVPR (2015)
50.
go back to reference Ladner, R.: My path to becoming an accessibility researcher. In: SIGACCESS (2014) Ladner, R.: My path to becoming an accessibility researcher. In: SIGACCESS (2014)
51.
go back to reference Russell, B.C., et al.: 3D Wikipedia: using online text to automatically label and navigate reconstructed geometry. In: Siggraph Asia (2013) Russell, B.C., et al.: 3D Wikipedia: using online text to automatically label and navigate reconstructed geometry. In: Siggraph Asia (2013)
52.
go back to reference Seo, M.J., et al.: Diagram understanding in geometry questions. In: AAAI (2014) Seo, M.J., et al.: Diagram understanding in geometry questions. In: AAAI (2014)
53.
55.
go back to reference Williams, K., et al.: Simseerx: a similar document search engine. In: DocEng (2014) Williams, K., et al.: Simseerx: a similar document search engine. In: DocEng (2014)
56.
go back to reference Noorden, V.: Publishers withdraw more than 120 gibberish papers. In: Nature (2014) Noorden, V.: Publishers withdraw more than 120 gibberish papers. In: Nature (2014)
57.
go back to reference Sironi, A., et al.: Multiscale centerline detection by learning a scale-space distance transform. In: CVPR (2014) Sironi, A., et al.: Multiscale centerline detection by learning a scale-space distance transform. In: CVPR (2014)
Metadata
Title
FigureSeer: Parsing Result-Figures in Research Papers
Authors
Noah Siegel
Zachary Horvitz
Roie Levin
Santosh Divvala
Ali Farhadi
Copyright Year
2016
DOI
https://doi.org/10.1007/978-3-319-46478-7_41

Premium Partner