Skip to main content
Top

2018 | OriginalPaper | Chapter

iExplore: Accelerating Exploratory Data Analysis by Predicting User Intention

Authors : Zhihui Yang, Jiyang Gong, Chaoying Liu, Yinan Jing, Zhenying He, Kai Zhang, X. Sean Wang

Published in: Database Systems for Advanced Applications

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Exploratory data analysis over large datasets has become an increasingly prevalent use case. However, users are easily overwhelmed by the data and might take a long time to find interesting facts. In this paper, we design a system called iExplore to assist users in doing this time-consuming data exploration task through predicting user intention. Moreover, we propose an intention model to help the iExplore system have a comprehensive understanding of user’s intention. Thus, the exploratory process can be accelerated by the intention-driven recommendation and prefetching mechanisms. Extensive experiments demonstrate that the intention-driven iExplore system can significantly lighten the burden of users and facilitate the exploratory process.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Abazajian, K.N., Adelman-McCarthy, J.K., Agüeros, M.A., Allam, S.S., Prieto, C.A., An, D., Anderson, K.S., Anderson, S.F., Annis, J., Bahcall, N.A., et al.: The seventh data release of the sloan digital sky survey. Astrophys. J. Suppl. Ser. 182(2), 543 (2009)CrossRef Abazajian, K.N., Adelman-McCarthy, J.K., Agüeros, M.A., Allam, S.S., Prieto, C.A., An, D., Anderson, K.S., Anderson, S.F., Annis, J., Bahcall, N.A., et al.: The seventh data release of the sloan digital sky survey. Astrophys. J. Suppl. Ser. 182(2), 543 (2009)CrossRef
2.
go back to reference Aouiche, K., Darmont, J.: Data mining-based materialized view and index selection in data warehouses. J. Intell. Inf. Syst. 33(1), 65–93 (2009)CrossRef Aouiche, K., Darmont, J.: Data mining-based materialized view and index selection in data warehouses. J. Intell. Inf. Syst. 33(1), 65–93 (2009)CrossRef
3.
go back to reference Bowman, I.T., Salem, K.: Semantic prefetching of correlated query sequences. In: 2007 IEEE 23rd International Conference on Data Engineering, ICDE 2007, pp. 1284–1288. IEEE (2007) Bowman, I.T., Salem, K.: Semantic prefetching of correlated query sequences. In: 2007 IEEE 23rd International Conference on Data Engineering, ICDE 2007, pp. 1284–1288. IEEE (2007)
5.
go back to reference Crane, M.: Diversified relevance feedback. In: Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, p. 1142. ACM (2013) Crane, M.: Diversified relevance feedback. In: Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, p. 1142. ACM (2013)
6.
go back to reference Dimitriadou, K., Papaemmanouil, O., Diao, Y.: AIDE: an active learning-based approach for interactive data exploration. IEEE Trans. Knowl. Data Eng. 28(11), 2842–2856 (2016)CrossRef Dimitriadou, K., Papaemmanouil, O., Diao, Y.: AIDE: an active learning-based approach for interactive data exploration. IEEE Trans. Knowl. Data Eng. 28(11), 2842–2856 (2016)CrossRef
7.
go back to reference Drosou, M., Pitoura, E.: Ymaldb: exploring relational databases via result-driven recommendations. VLDB J. 22(6), 849–874 (2013)CrossRef Drosou, M., Pitoura, E.: Ymaldb: exploring relational databases via result-driven recommendations. VLDB J. 22(6), 849–874 (2013)CrossRef
8.
go back to reference Eirinaki, M., Abraham, S., Polyzotis, N., Shaikh, N.: Querie: collaborative database exploration. IEEE Trans. Knowl. Data Eng. 26(7), 1778–1790 (2014)CrossRef Eirinaki, M., Abraham, S., Polyzotis, N., Shaikh, N.: Querie: collaborative database exploration. IEEE Trans. Knowl. Data Eng. 26(7), 1778–1790 (2014)CrossRef
9.
go back to reference Gagniuc, P.A.: Markov Chains: From Theory to Implementation and Experimentation. Wiley, Hoboken (2017)CrossRef Gagniuc, P.A.: Markov Chains: From Theory to Implementation and Experimentation. Wiley, Hoboken (2017)CrossRef
10.
go back to reference Idreos, S., Papaemmanouil, O., Chaudhuri, S.: Overview of data exploration techniques. In: Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, pp. 277–281. ACM (2015) Idreos, S., Papaemmanouil, O., Chaudhuri, S.: Overview of data exploration techniques. In: Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, pp. 277–281. ACM (2015)
11.
go back to reference Kamat, N., Jayachandran, P., Tunga, K., Nandi, A.: Distributed and interactive cube exploration. In: 2014 IEEE 30th International Conference on Data Engineering (ICDE), pp. 472–483. IEEE (2014) Kamat, N., Jayachandran, P., Tunga, K., Nandi, A.: Distributed and interactive cube exploration. In: 2014 IEEE 30th International Conference on Data Engineering (ICDE), pp. 472–483. IEEE (2014)
12.
go back to reference Khoussainova, N., Kwon, Y., Balazinska, M., Suciu, D.: SnipSuggest: context-aware autocompletion for SQL. Proc. VLDB Endow. 4(1), 22–33 (2010)CrossRef Khoussainova, N., Kwon, Y., Balazinska, M., Suciu, D.: SnipSuggest: context-aware autocompletion for SQL. Proc. VLDB Endow. 4(1), 22–33 (2010)CrossRef
14.
go back to reference Ramachandran, K., Shah, B., Raghavan, V.V.: Dynamic pre-fetching of views based on user-access patterns in an OLAP system. In: ICEIS, vol. 1, pp. 60–67 (2005) Ramachandran, K., Shah, B., Raghavan, V.V.: Dynamic pre-fetching of views based on user-access patterns in an OLAP system. In: ICEIS, vol. 1, pp. 60–67 (2005)
16.
go back to reference Sellam, T., Kersten, M.: Cluster-driven navigation of the query space. IEEE Trans. Knowl. Data Eng. 28(5), 1118–1131 (2016)CrossRef Sellam, T., Kersten, M.: Cluster-driven navigation of the query space. IEEE Trans. Knowl. Data Eng. 28(5), 1118–1131 (2016)CrossRef
17.
go back to reference Singh, V., Gray, J., Thakar, A., Szalay, A.S., Raddick, J., Boroski, B., Lebedeva, S., Yanny, B.: Skyserver traffic report-the first five years. arXiv preprint cs/0701173 (2007) Singh, V., Gray, J., Thakar, A., Szalay, A.S., Raddick, J., Boroski, B., Lebedeva, S., Yanny, B.: Skyserver traffic report-the first five years. arXiv preprint cs/0701173 (2007)
18.
go back to reference Tauheed, F., Heinis, T., Schürmann, F., Markram, H., Ailamaki, A.: SCOUT: prefetching for latent structure following queries. Proc. VLDB Endow. 5(11), 1531–1542 (2012)CrossRef Tauheed, F., Heinis, T., Schürmann, F., Markram, H., Ailamaki, A.: SCOUT: prefetching for latent structure following queries. Proc. VLDB Endow. 5(11), 1531–1542 (2012)CrossRef
19.
go back to reference Zhang, J., Chen, C., Vogeley, M.S., Pan, D., Thakar, A., Raddick, J.: SDSS log viewer: visual exploratory analysis of large-volume SQL log data. In: Visualization and Data Analysis 2012, vol. 8294, pp. 82940D. International Society for Optics and Photonics (2012) Zhang, J., Chen, C., Vogeley, M.S., Pan, D., Thakar, A., Raddick, J.: SDSS log viewer: visual exploratory analysis of large-volume SQL log data. In: Visualization and Data Analysis 2012, vol. 8294, pp. 82940D. International Society for Optics and Photonics (2012)
Metadata
Title
iExplore: Accelerating Exploratory Data Analysis by Predicting User Intention
Authors
Zhihui Yang
Jiyang Gong
Chaoying Liu
Yinan Jing
Zhenying He
Kai Zhang
X. Sean Wang
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-319-91458-9_9

Premium Partner