Skip to main content

2015 | OriginalPaper | Buchkapitel

Dynamic Time Warping Distance for Message Propagation Classification in Twitter

verfasst von : Siwar Jendoubi, Arnaud Martin, Ludovic Liétard, Boutheina Ben Yaghlane, Hend Ben Hadji

Erschienen in: Symbolic and Quantitative Approaches to Reasoning with Uncertainty

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Social messages classification is a research domain that has attracted the attention of many researchers in these last years. Indeed, the social message is different from ordinary text because it has some special characteristics like its shortness. Then the development of new approaches for the processing of the social message is now essential to make its classification more efficient. In this paper, we are mainly interested in the classification of social messages based on their spreading on online social networks (OSN). We proposed a new distance metric based on the Dynamic Time Warping distance and we use it with the probabilistic and the evidential k Nearest Neighbors (k-NN) classifiers to classify propagation networks (PrNets) of messages. The propagation network is a directed acyclic graph (DAG) that is used to record propagation traces of the message, the traversed links and their types. We tested the proposed metric with the chosen k-NN classifiers on real world propagation traces that were collected from Twitter social network and we got good classification accuracies.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
We call propagation network the network that conserves propagation traces of the message, i.e. traversed links and nodes.
 
2
We call propagation level the number of links between the source of the message and the target node.
 
3
Directed Acyclic Graph.
 
4
Twitter4j is a java library for the Twitter API, it is an open-sourced software and free of charge and it was created by Yusuke Yamamoto. More details can be found in http://​twitter4j.​org/​en/​index.​html.
 
Literatur
1.
Zurück zum Zitat Aregui, A., Denœux, T.: Fusion of one-class classifiers in the belief function framework. In: Proceedings of FUSION, Québec, Canada, Juillet 2007 Aregui, A., Denœux, T.: Fusion of one-class classifiers in the belief function framework. In: Proceedings of FUSION, Québec, Canada, Juillet 2007
2.
Zurück zum Zitat Aregui, A., Denoeux, T.: Constructing consonant belief functions from sample data using confidence sets of pignistic probabilities. Int. J. Approximate Reasoning 49(3), 575–594 (2008)MathSciNetCrossRefMATH Aregui, A., Denoeux, T.: Constructing consonant belief functions from sample data using confidence sets of pignistic probabilities. Int. J. Approximate Reasoning 49(3), 575–594 (2008)MathSciNetCrossRefMATH
3.
Zurück zum Zitat Banerjee, S., Ramanathan, K., Gupta, A.: Clustering short texts using wikipedia. In: Proceedings of ACM SIGIR Conference, pp. 787–788. ACM (2007) Banerjee, S., Ramanathan, K., Gupta, A.: Clustering short texts using wikipedia. In: Proceedings of ACM SIGIR Conference, pp. 787–788. ACM (2007)
4.
Zurück zum Zitat Ben Jabeur, L.: Leveraging social relevance: Using social networks to enhance literature access and microblog search. Ph.D. thesis, Université Toulouse 3 Paul Sabatier (UT3 Paul Sabatier), October 2013 Ben Jabeur, L.: Leveraging social relevance: Using social networks to enhance literature access and microblog search. Ph.D. thesis, Université Toulouse 3 Paul Sabatier (UT3 Paul Sabatier), October 2013
5.
Zurück zum Zitat Bhatia, N.: Vandana: Survey of nearest neighbor techniques. IJCSIS 8(2), 302–305 (2010) Bhatia, N.: Vandana: Survey of nearest neighbor techniques. IJCSIS 8(2), 302–305 (2010)
6.
Zurück zum Zitat Bunke, H., Foggia, P., Guidobaldi, C., Sansone, C., Vento, M.: A comparison of algorithms for maximum common subgraph on randomly connected graphs. In: Caelli, T., Amin, A., Duin, R.P.W., de Ridder, D., Kamel, M. (eds.) SSSPR 2002. LNCS, vol. 2396, pp. 123–132. Springer, Heidelberg (2002)CrossRef Bunke, H., Foggia, P., Guidobaldi, C., Sansone, C., Vento, M.: A comparison of algorithms for maximum common subgraph on randomly connected graphs. In: Caelli, T., Amin, A., Duin, R.P.W., de Ridder, D., Kamel, M. (eds.) SSSPR 2002. LNCS, vol. 2396, pp. 123–132. Springer, Heidelberg (2002)CrossRef
7.
8.
Zurück zum Zitat Denœux, T.: A \(k\)-nearest neighbor classification rule based on dempster-shafer theory. IEEE Trans. Syst., Man, Cybern.- Part A: Syst. Hum. 25(5), 804–813 (1995) Denœux, T.: A \(k\)-nearest neighbor classification rule based on dempster-shafer theory. IEEE Trans. Syst., Man, Cybern.- Part A: Syst. Hum. 25(5), 804–813 (1995)
9.
Zurück zum Zitat Gao, X., Xiao, B., Tao, D., Li, X.: A survey of graph edit distance. Int. J. Future Comput. Commun. 13(1), 113–129 (2010)MathSciNet Gao, X., Xiao, B., Tao, D., Li, X.: A survey of graph edit distance. Int. J. Future Comput. Commun. 13(1), 113–129 (2010)MathSciNet
10.
Zurück zum Zitat He, W., Zhab, S., Li, L.: Social media competitive analysis and text mining: A case study in the pizza industry. Int. J. Inf. Manage. 33, 464–472 (2013)CrossRef He, W., Zhab, S., Li, L.: Social media competitive analysis and text mining: A case study in the pizza industry. Int. J. Inf. Manage. 33, 464–472 (2013)CrossRef
11.
Zurück zum Zitat Hu, X., Sun, N., Zhang, C., Chua, T.S.: Exploiting internal and external semantics for the clustering of short texts using world knowledge. In: Proceedings of CIKM, pp. 919–928. ACM (2009) Hu, X., Sun, N., Zhang, C., Chua, T.S.: Exploiting internal and external semantics for the clustering of short texts using world knowledge. In: Proceedings of CIKM, pp. 919–928. ACM (2009)
12.
Zurück zum Zitat Jendoubi, S., Martin, A., Liétard, L., Ben Yaghlane, B.: Classification of message spreading in a heterogeneous social network. In: Laurent, A., Strauss, O., Bouchon-Meunier, B., Yager, R.R. (eds.) IPMU 2014, Part II. CCIS, vol. 443, pp. 66–75. Springer, Heidelberg (2014) Jendoubi, S., Martin, A., Liétard, L., Ben Yaghlane, B.: Classification of message spreading in a heterogeneous social network. In: Laurent, A., Strauss, O., Bouchon-Meunier, B., Yager, R.R. (eds.) IPMU 2014, Part II. CCIS, vol. 443, pp. 66–75. Springer, Heidelberg (2014)
13.
Zurück zum Zitat Lo, Y.W., Potdar, V.: A review of opinion mining and sentiment classification framework in social networks. In: Proceedings of DEST 2009, June 2009 Lo, Y.W., Potdar, V.: A review of opinion mining and sentiment classification framework in social networks. In: Proceedings of DEST 2009, June 2009
14.
Zurück zum Zitat Mostafa, M.M.: More than words: social networks text mining for consumer brand sentiments. Expert Syst. Appl. 40, 4241–4251 (2013)CrossRef Mostafa, M.M.: More than words: social networks text mining for consumer brand sentiments. Expert Syst. Appl. 40, 4241–4251 (2013)CrossRef
15.
Zurück zum Zitat Othman, M., Hassan, H., Moawad, R., El-Korany, A.: Opinion mining and sentimental analysis approaches: a survey. Life Sci. J. 11(4), 321–326 (2014) Othman, M., Hassan, H., Moawad, R., El-Korany, A.: Opinion mining and sentimental analysis approaches: a survey. Life Sci. J. 11(4), 321–326 (2014)
16.
Zurück zum Zitat Petitjean, F., Inglada, J., Gancarski, P.: Satellite image time series analysis under time warping. IEEE Trans. Geosci. Remote Sens. 50(8), 3081–3095 (2012)CrossRef Petitjean, F., Inglada, J., Gancarski, P.: Satellite image time series analysis under time warping. IEEE Trans. Geosci. Remote Sens. 50(8), 3081–3095 (2012)CrossRef
17.
Zurück zum Zitat Phan, X.H., Nguyen, L.M., Horiguchi, S.: Learning to classify short and sparse text and web with hidden topics from large-scale data collections. In: Proceedings of WWW 2009, pp. 91–100. ACM (2009) Phan, X.H., Nguyen, L.M., Horiguchi, S.: Learning to classify short and sparse text and web with hidden topics from large-scale data collections. In: Proceedings of WWW 2009, pp. 91–100. ACM (2009)
18.
Zurück zum Zitat Sakoe, H., Chiba, S.: A dynamic programming approach to continuous speech recognition. In: Proceedings of the Seventh International Congress on Acoustics, Budapest, vol. 3, pp. 65–69 (1971) Sakoe, H., Chiba, S.: A dynamic programming approach to continuous speech recognition. In: Proceedings of the Seventh International Congress on Acoustics, Budapest, vol. 3, pp. 65–69 (1971)
19.
Zurück zum Zitat Sebastiani, F.: Machine learning in automated text categorization. ACM Comput. Surv. 34(1), 1–47 (2002)CrossRef Sebastiani, F.: Machine learning in automated text categorization. ACM Comput. Surv. 34(1), 1–47 (2002)CrossRef
20.
Zurück zum Zitat Shafer, G.: A mathematical theory of evidence. Princeton University Press, Princeton (1976)MATH Shafer, G.: A mathematical theory of evidence. Princeton University Press, Princeton (1976)MATH
21.
Zurück zum Zitat Smets, P.: The Combination of evidence in the transferable belief model. IEEE Trans. Pattern Anal. Mach. Intell. 12(5), 447–458 (1990)CrossRef Smets, P.: The Combination of evidence in the transferable belief model. IEEE Trans. Pattern Anal. Mach. Intell. 12(5), 447–458 (1990)CrossRef
22.
Zurück zum Zitat Smets, P.: Belief functions: the disjunctive rule of combination and the generalized bayesian theorem. Int. J. Approximate Reasoning 9, 1–35 (1993)MathSciNetCrossRefMATH Smets, P.: Belief functions: the disjunctive rule of combination and the generalized bayesian theorem. Int. J. Approximate Reasoning 9, 1–35 (1993)MathSciNetCrossRefMATH
23.
Zurück zum Zitat Smets, P.: Data fusion in the tranferable belief model. In: Proceedings of FUSION, Paris, France, vol. 1, pp. 21–33, (2000) Smets, P.: Data fusion in the tranferable belief model. In: Proceedings of FUSION, Paris, France, vol. 1, pp. 21–33, (2000)
24.
Zurück zum Zitat Smets, P.: Decision making in the TBM: the necessity of the pignistic transformation. Int. J. Approximate Reasonning 38, 133–147 (2005)MathSciNetCrossRefMATH Smets, P.: Decision making in the TBM: the necessity of the pignistic transformation. Int. J. Approximate Reasonning 38, 133–147 (2005)MathSciNetCrossRefMATH
25.
Zurück zum Zitat Sriram, B., Fuhry, D., Demir, E., Ferhatosmanoglu, H., Demirbas, M.: Short text classification in twitter to improve information filtering. In: Proceedings of ACM SIGIR, pp. 841–842. ACM (2010) Sriram, B., Fuhry, D., Demir, E., Ferhatosmanoglu, H., Demirbas, M.: Short text classification in twitter to improve information filtering. In: Proceedings of ACM SIGIR, pp. 841–842. ACM (2010)
Metadaten
Titel
Dynamic Time Warping Distance for Message Propagation Classification in Twitter
verfasst von
Siwar Jendoubi
Arnaud Martin
Ludovic Liétard
Boutheina Ben Yaghlane
Hend Ben Hadji
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-20807-7_38