Skip to main content
Top

2019 | OriginalPaper | Chapter

A Methodology for Resolving Heterogeneity and Interdependence in Data Analytics

Authors : Han Han, Yunwei Zhao, Can Wang, Min Shu, Tao Peng, Chi-Hung Chi, Yonghong Yu

Published in: Advanced Data Mining and Applications

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The big data analytics achieves wide application in a number of areas due to its capability in uncovering hidden patterns, correlations and insights through integrating multiple data sources. However, the interdependence and heterogeneity features of these data sources pose a big challenge in managing these data sources to support “last mile” analytics in decision making and value co-creation which are usually with multiple perspectives and at multiple granularities. In this paper, we propose a unified knowledge representation framework, namely, Cyber-Entity (Cyber-E) modeling, to capture and formalize selected behaviors of real entities in both the social and physical worlds to the cyber analytic space. Its special features include not only the stateful, intra- properties of a Cyber-E, but also the inter-relationship and dependence among them. A grouping mechanism, called Cyber-G, is also introduced to support flexible granularity adjustment in the knowledge management. It supports rapid on-demand self-service analytics. An illustrating example of applying this approach in academic research community is given, followed by a case study of two top conferences in service computing area– ICSOC and ICWS– to illustrate the effectiveness and potentials of our approach.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
The CAF can take multiple inputs and gives one single output. More specifically, we have (a) the input of caf can be raw data, can also be property output of the same or other CAF, (b) different CAF can share the same input, (c) an algorithm of multiple outputs could be decomposed into multiple single-output algorithms. Correspondingly, the number of the input arrows could be 1 or many, while the number of output arrows could only be 1.
 
2
There are two situations for the output of a potential inter-group CAF: (i) a property of a Cyber-G, or (ii) a property of a Cyber-E which belongs to certain Cyber-G. Suppose \(GS\ne \emptyset \), for each situation, the definition is given in Definition 10
 
6
Due to data limitations, the propagation through the relational properties (i.e., “Published In Venue”, “Cited By Author”, “Cited By Paper”) is broken as illustrated by line \(l_1\) and \(l_2\), as shown in Fig. 2.
 
Literature
1.
go back to reference Lustig, I., Dietrich, B., et al.: The analytics journey. Analytics Mag. (2010) Lustig, I., Dietrich, B., et al.: The analytics journey. Analytics Mag. (2010)
3.
go back to reference Miller, G.: Social scientists wade into the tweet stream. Science 333(6051), 1814–1815 (2011)CrossRef Miller, G.: Social scientists wade into the tweet stream. Science 333(6051), 1814–1815 (2011)CrossRef
4.
go back to reference Johan, B., Huina, M.: Twitter mood as a stock market predictor. IEEE Comput. 44(10), 91–94 (2011)CrossRef Johan, B., Huina, M.: Twitter mood as a stock market predictor. IEEE Comput. 44(10), 91–94 (2011)CrossRef
5.
go back to reference Kenny, D.A., Cook, W.L.: Dyadic Data Analysis. The Guilford Press, New York (2006) Kenny, D.A., Cook, W.L.: Dyadic Data Analysis. The Guilford Press, New York (2006)
6.
go back to reference Brachman, R., Levesque, H.: Knowledge Representation and Reasoning. Morgan Kaufmann, San Francisco (2004)MATH Brachman, R., Levesque, H.: Knowledge Representation and Reasoning. Morgan Kaufmann, San Francisco (2004)MATH
7.
go back to reference Zhang, D., Guo, B., Yu, Z.: The emergence of social and community intelligence. IEEE Comput. 44(7), 21–28 (2011)CrossRef Zhang, D., Guo, B., Yu, Z.: The emergence of social and community intelligence. IEEE Comput. 44(7), 21–28 (2011)CrossRef
8.
go back to reference Bergstrom, C.: Eigenfactor: measuring the value and prestige of scholarly journals. College Res. Libr. News 68(5), 314–316 (2007)CrossRef Bergstrom, C.: Eigenfactor: measuring the value and prestige of scholarly journals. College Res. Libr. News 68(5), 314–316 (2007)CrossRef
9.
go back to reference Cheang, B., Chu, S., et al.: A multidimensional approach to evaluating management journals: refining pagerank via the differentiation of citation types and identifying the roles that management journals play. J. Am. Soc. Inform. Sci. Technol. 65(12), 2581–2591 (2014)CrossRef Cheang, B., Chu, S., et al.: A multidimensional approach to evaluating management journals: refining pagerank via the differentiation of citation types and identifying the roles that management journals play. J. Am. Soc. Inform. Sci. Technol. 65(12), 2581–2591 (2014)CrossRef
10.
go back to reference Bollen, J., Rodriguez, M.A., et al.: Journal status. Scientometrics 69(3), 669–687 (2006)CrossRef Bollen, J., Rodriguez, M.A., et al.: Journal status. Scientometrics 69(3), 669–687 (2006)CrossRef
11.
go back to reference Alonso, S., Cabrerizo, F.J., et al.: h-index: a review focused in its variants, computation and standardization for different scientific fields. J. Inf. 3(4), 273–289 (2009) Alonso, S., Cabrerizo, F.J., et al.: h-index: a review focused in its variants, computation and standardization for different scientific fields. J. Inf. 3(4), 273–289 (2009)
12.
go back to reference Guerrero-Bote, V.P., Moya-Anegon, F.: Relationship between downloads and citations at journal and paper levels, and the influence of language. Scientometrics 101(2), 1043–1065 (2014)CrossRef Guerrero-Bote, V.P., Moya-Anegon, F.: Relationship between downloads and citations at journal and paper levels, and the influence of language. Scientometrics 101(2), 1043–1065 (2014)CrossRef
13.
go back to reference Aduku, K.J., ThelWall, M., et al.: Do Mendeley reader counts reflect the scholarly impact of conference papers? An investigation of computer science and engineering. Scientometrics 112(1), 1–9 (2017)CrossRef Aduku, K.J., ThelWall, M., et al.: Do Mendeley reader counts reflect the scholarly impact of conference papers? An investigation of computer science and engineering. Scientometrics 112(1), 1–9 (2017)CrossRef
14.
go back to reference Zhuang, Z., Elmacioglu, E., et al.: Measuring conference quality by mining program committee characteristics. In: Proceedings of the 7th ACM/IEEE-CS Joint Conference on Digital Libraries, Vancouver, BC, Canada (2007) Zhuang, Z., Elmacioglu, E., et al.: Measuring conference quality by mining program committee characteristics. In: Proceedings of the 7th ACM/IEEE-CS Joint Conference on Digital Libraries, Vancouver, BC, Canada (2007)
15.
go back to reference Yan, E., Ding, Y.: Discovering author impact: a PageRank perspective. Inf. Process. Manage. 47(1), 125–134 (2011)CrossRef Yan, E., Ding, Y.: Discovering author impact: a PageRank perspective. Inf. Process. Manage. 47(1), 125–134 (2011)CrossRef
17.
go back to reference Ma, N., Guan, J., et al.: Bringing PageRank to the citation analysis. Inf. Process. Manage. 44(2), 800–810 (2008)MathSciNetCrossRef Ma, N., Guan, J., et al.: Bringing PageRank to the citation analysis. Inf. Process. Manage. 44(2), 800–810 (2008)MathSciNetCrossRef
18.
go back to reference Yan, E., Ding, Y., et al.: P-rank: an indicator measuring prestige in heterogeneous scholarly networks. J. Am. Soc. Inform. Sci. Technol. 62(3), 467–477 (2011) Yan, E., Ding, Y., et al.: P-rank: an indicator measuring prestige in heterogeneous scholarly networks. J. Am. Soc. Inform. Sci. Technol. 62(3), 467–477 (2011)
19.
go back to reference Mu, D., Guo, L., et al.: Query-focused personalized citation recommendation with mutually reinforced rankingk. IEEE Access, 3107–3119 (2018)CrossRef Mu, D., Guo, L., et al.: Query-focused personalized citation recommendation with mutually reinforced rankingk. IEEE Access, 3107–3119 (2018)CrossRef
20.
go back to reference Liu, Z., Huang, H., et al.: Tri-rank: an authority ranking framework in heterogeneous academic networks by mutual reinforce. In: 2014 IEEE 26th International Conference on Tools with Artificial Intelligence, pp. 493–500 (2014) Liu, Z., Huang, H., et al.: Tri-rank: an authority ranking framework in heterogeneous academic networks by mutual reinforce. In: 2014 IEEE 26th International Conference on Tools with Artificial Intelligence, pp. 493–500 (2014)
21.
go back to reference Guerrero-Bote, V.P., Moya-Anegón, F.: A further step forward in measuring journals’ scientific prestige: the SJR2 indicator. J. Inf. 6(4), 674–688 (2012) Guerrero-Bote, V.P., Moya-Anegón, F.: A further step forward in measuring journals’ scientific prestige: the SJR2 indicator. J. Inf. 6(4), 674–688 (2012)
Metadata
Title
A Methodology for Resolving Heterogeneity and Interdependence in Data Analytics
Authors
Han Han
Yunwei Zhao
Can Wang
Min Shu
Tao Peng
Chi-Hung Chi
Yonghong Yu
Copyright Year
2019
DOI
https://doi.org/10.1007/978-3-030-35231-8_2

Premium Partner