Skip to main content
Top
Published in: Cluster Computing 2/2019

03-01-2018

Multi-classification cluster analysis of large data based on knowledge element in microblogging short text

Authors: Wen Aihong, Yan Nan, Xu Caocao

Published in: Cluster Computing | Special Issue 2/2019

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In order to improve performance of personalized label recommendation, the recommendation algorithm of implied relation topic model (RTM) for microblog personalized label based on Gibbs sampling inference is proposed. Firstly, imaging form is used to express potential local information in microblog and to conduct top-k similar user discovery for users represented as user topic distribution, and then the frequency of all labels in these users is calculated to recommend the label mostly related to the users. Secondly, to dig potential topic information, enhancement cosine similarity RTM model with penalty term is used to name the microblog label, which greatly improves the influence of joint modeling on potential topic generation label, and the relationship between overall label and topic can be found; finally, it can be seen from real experimental result that the proposed recommendation method is superior to the selected TF–IDF, RTMSA and other classic label recommendation algorithm, so as to verify effectiveness of the algorithm.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Hsu, H.M., Zeng, W.S., Hung, C.S., et al.: Frame dispatcher: a multi-frame classification system for social movement by using microblogging data. In: IEEE/WIC/ACM International Conference on Web Intelligence, pp. 588–591. IEEE (2016) Hsu, H.M., Zeng, W.S., Hung, C.S., et al.: Frame dispatcher: a multi-frame classification system for social movement by using microblogging data. In: IEEE/WIC/ACM International Conference on Web Intelligence, pp. 588–591. IEEE (2016)
2.
go back to reference Luo, Z., Wu, X., Cai, W., et al.: Examining multi-factor interactions in microblogging based on log-linear modeling. In: IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, pp. 189–193. ACM (2012) Luo, Z., Wu, X., Cai, W., et al.: Examining multi-factor interactions in microblogging based on log-linear modeling. In: IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, pp. 189–193. ACM (2012)
3.
go back to reference Yang, J., Li, Q., Zhuang, Y.: OCTOPUS: aggressive search of multi-modality data using multifaceted knowledge base. In: International Conference on World Wide Web, pp. 54–64. ACM (2002) Yang, J., Li, Q., Zhuang, Y.: OCTOPUS: aggressive search of multi-modality data using multifaceted knowledge base. In: International Conference on World Wide Web, pp. 54–64. ACM (2002)
4.
go back to reference Shi, W., Wang, H., He, S.: Sentiment analysis of Chinese microblogging based on sentiment ontology: a case study of ‘7.23 Wenzhou Train Collision. Connect. Sci. 25(4), 161–178 (2013) Shi, W., Wang, H., He, S.: Sentiment analysis of Chinese microblogging based on sentiment ontology: a case study of ‘7.23 Wenzhou Train Collision. Connect. Sci. 25(4), 161–178 (2013)
5.
go back to reference Zhang, Y., Shang, L., Jia, X.: Sentiment analysis on microblogging by integrating text and image features. In: Advances in Knowledge Discovery and Data Mining, pp. 52–63. Springer International Publishing (2015) Zhang, Y., Shang, L., Jia, X.: Sentiment analysis on microblogging by integrating text and image features. In: Advances in Knowledge Discovery and Data Mining, pp. 52–63. Springer International Publishing (2015)
6.
go back to reference Lu, J.Y., Zuo, W.L., Zhu, L.: Recognizing event in short text based on decision tree. Appl. Mech. Mater. 571–572, 237–240 (2014) Lu, J.Y., Zuo, W.L., Zhu, L.: Recognizing event in short text based on decision tree. Appl. Mech. Mater. 571–572, 237–240 (2014)
7.
go back to reference Sriram, B., Fuhry, D., Demir, E., et al.: Short text classification in twitter to improve information filtering. In: Proceeding of the, International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2010, Geneva, Switzerland, July, pp. 841–842. DBLP (2010) Sriram, B., Fuhry, D., Demir, E., et al.: Short text classification in twitter to improve information filtering. In: Proceeding of the, International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2010, Geneva, Switzerland, July, pp. 841–842. DBLP (2010)
8.
go back to reference Sakaki, T., Okazaki, M., Matsuo, Y.: Tweet analysis for real-time event detection and earthquake reporting system development. IEEE Trans. Knowl. Data Eng. 25(4), 919–931 (2013) Sakaki, T., Okazaki, M., Matsuo, Y.: Tweet analysis for real-time event detection and earthquake reporting system development. IEEE Trans. Knowl. Data Eng. 25(4), 919–931 (2013)
9.
go back to reference Pang, G., Jin, H., Jiang, S.: CenKNN: a scalable and effective text classifier. Data Min. Knowl. Discov. 29(3), 593–625 (2015) Pang, G., Jin, H., Jiang, S.: CenKNN: a scalable and effective text classifier. Data Min. Knowl. Discov. 29(3), 593–625 (2015)
10.
go back to reference Qin, Y., Sheng, Q.Z., Falkner, N.J.G., et al.: When things matter: a data-centric view of the Internet of things. Comput. Sci. (2014) Qin, Y., Sheng, Q.Z., Falkner, N.J.G., et al.: When things matter: a data-centric view of the Internet of things. Comput. Sci. (2014)
11.
go back to reference Bell, D., Koulouri, T., Lauria, S., et al.: Microblogging as a mechanism for human-robot interaction. Knowl.-Based Syst. 2014, 64–77 (2014) Bell, D., Koulouri, T., Lauria, S., et al.: Microblogging as a mechanism for human-robot interaction. Knowl.-Based Syst. 2014, 64–77 (2014)
12.
go back to reference Chan, C.C., Liszka, K.J.: Application of rough set theory to sentiment analysis of microblog data. In: Rough Sets and Intelligent Systems—Professor Zdzisław Pawlak in Memoriam, pp. 185–202. Springer, Berlin (2013) Chan, C.C., Liszka, K.J.: Application of rough set theory to sentiment analysis of microblog data. In: Rough Sets and Intelligent Systems—Professor Zdzisław Pawlak in Memoriam, pp. 185–202. Springer, Berlin (2013)
13.
go back to reference Liu, H.C., Wang, J.H.: Social influence estimation for short texts in plurk. In: International Conference on Advances in Social Networks Analysis and Mining, pp. 1012–1017. IEEE Computer Society (2012) Liu, H.C., Wang, J.H.: Social influence estimation for short texts in plurk. In: International Conference on Advances in Social Networks Analysis and Mining, pp. 1012–1017. IEEE Computer Society (2012)
14.
go back to reference Pennacchiotti, M., Popescu, A.M.: Democrats, republicans and starbucks afficionados: user classification in twitter. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, CA, USA, August, pp. 430–438. DBLP (2011) Pennacchiotti, M., Popescu, A.M.: Democrats, republicans and starbucks afficionados: user classification in twitter. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, CA, USA, August, pp. 430–438. DBLP (2011)
15.
go back to reference Davidov, D., Tsur, O., Rappoport, A.: Enhanced sentiment learning using twitter Hashtags and Smileys. In: COLING 2010, International Conference on Computational Linguistics, Posters Volume, 23–27 August 2010, Beijing, China, pp. 241–249. DBLP (2010) Davidov, D., Tsur, O., Rappoport, A.: Enhanced sentiment learning using twitter Hashtags and Smileys. In: COLING 2010, International Conference on Computational Linguistics, Posters Volume, 23–27 August 2010, Beijing, China, pp. 241–249. DBLP (2010)
16.
go back to reference Arunkumar, N., Ramkumar, K., Venkatraman, V., Abdulhay, E., Fernandes, S.L., Kadry, S., Segal, S.: Classification of focal and non focal EEG using entropies. Pattern Recognit. Lett. 94, 112–117 (2017) Arunkumar, N., Ramkumar, K., Venkatraman, V., Abdulhay, E., Fernandes, S.L., Kadry, S., Segal, S.: Classification of focal and non focal EEG using entropies. Pattern Recognit. Lett. 94, 112–117 (2017)
17.
go back to reference Arunkumar, N., Kumar, K.R., Venkataraman, V.: Automatic detection of epileptic seizures using new entropy measures. J. Med. Imaging Health Inf. 6(3), 724–730 (2016) Arunkumar, N., Kumar, K.R., Venkataraman, V.: Automatic detection of epileptic seizures using new entropy measures. J. Med. Imaging Health Inf. 6(3), 724–730 (2016)
18.
go back to reference Stephygraph, L.R., Arunkumar, N.: Brain-actuated wireless mobile robot control through an adaptive human-machine interface. Adv. Intell. Syst. Comput. 397, 537–549 (2016) Stephygraph, L.R., Arunkumar, N.: Brain-actuated wireless mobile robot control through an adaptive human-machine interface. Adv. Intell. Syst. Comput. 397, 537–549 (2016)
Metadata
Title
Multi-classification cluster analysis of large data based on knowledge element in microblogging short text
Authors
Wen Aihong
Yan Nan
Xu Caocao
Publication date
03-01-2018
Publisher
Springer US
Published in
Cluster Computing / Issue Special Issue 2/2019
Print ISSN: 1386-7857
Electronic ISSN: 1573-7543
DOI
https://doi.org/10.1007/s10586-017-1517-9

Other articles of this Special Issue 2/2019

Cluster Computing 2/2019 Go to the issue

Premium Partner