Skip to main content
Erschienen in: Arabian Journal for Science and Engineering 4/2020

15.10.2019 | Research Article - Computer Engineering and Computer Science

A Novel Short Text Clustering Model Based on Grey System Theory

verfasst von: Hüseyin Fidan, Mehmet Erkan Yuksel

Erschienen in: Arabian Journal for Science and Engineering | Ausgabe 4/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Short text clustering has great challenges due to the structural reasons, especially when applied to small datasets. Limited number of words leads to a poor-quality feature vector, low clustering accuracy, and failure of analysis. Although some approaches have been observed in the related literature, there is still no agreement on an efficient solution. On the other hand, the Grey system theory, which gives better results in numerical analyses with insufficient data, has not yet been applied to short text clustering. The purpose of our study is to develop a short text clustering model based on Grey system theory applicable to small datasets. In order to measure the efficiency of our method, book reviews labeled as negative or positive were obtained from Amazon.com dataset collections, and small datasets have been created. The Grey relational clustering as well as hierarchical and partitional algorithms has been applied to the small datasets separately. According to the results, our model has better accuracy values than the other algorithms in clustering of small datasets containing short text. Consequently, we demonstrated that the Grey relational clustering should be applied to short text clustering for much better results.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Abbas, O.A.: Comparisons between data clustering algorithms. Int. Arab. J. Inf. Technol. 5(3), 320–325 (2008) Abbas, O.A.: Comparisons between data clustering algorithms. Int. Arab. J. Inf. Technol. 5(3), 320–325 (2008)
2.
Zurück zum Zitat Tajunisha, N.; Saravanan, V.: Performance analysis of K-means with different initialization methods for high dimensional data. Int. J. Artif. Intell. Appl. 1(4), 44–52 (2010) Tajunisha, N.; Saravanan, V.: Performance analysis of K-means with different initialization methods for high dimensional data. Int. J. Artif. Intell. Appl. 1(4), 44–52 (2010)
3.
Zurück zum Zitat Celebi, M.E.: Improving the performance of k-means for color quantization. Image Vis. Comput. 29, 260–271 (2011) Celebi, M.E.: Improving the performance of k-means for color quantization. Image Vis. Comput. 29, 260–271 (2011)
4.
Zurück zum Zitat Jun, S.; Park, S.S.; Jang, D.S.: Document clustering method using dimension reduction and support vector clustering to overcome sparseness. Expert Syst. Appl. 41, 3204–3212 (2014) Jun, S.; Park, S.S.; Jang, D.S.: Document clustering method using dimension reduction and support vector clustering to overcome sparseness. Expert Syst. Appl. 41, 3204–3212 (2014)
5.
Zurück zum Zitat Fidan, H.: E-ticaret Müşteri Bağlılığı Gri İlişkisel Kümeleme Analizi. AJIT-e Online Acad. J. Inf. Technol. 9(32), 163–182 (2018) Fidan, H.: E-ticaret Müşteri Bağlılığı Gri İlişkisel Kümeleme Analizi. AJIT-e Online Acad. J. Inf. Technol. 9(32), 163–182 (2018)
6.
Zurück zum Zitat Tuzhilin, A.: Customer relationship management and web mining: the next frontier. Data Min. Knowl. Discov. 24(3), 584–612 (2012) Tuzhilin, A.: Customer relationship management and web mining: the next frontier. Data Min. Knowl. Discov. 24(3), 584–612 (2012)
7.
Zurück zum Zitat Allahyari, M.; Pouriyeh, S.; Assefi, M.; Safaei, S.; Trippe, E.D.; Gutierrez, J.B.; Kochut, K.: Brief survey of text mining: classification, clustering and extraction techniques. In: KDD Bigdas Canada (2017) Allahyari, M.; Pouriyeh, S.; Assefi, M.; Safaei, S.; Trippe, E.D.; Gutierrez, J.B.; Kochut, K.: Brief survey of text mining: classification, clustering and extraction techniques. In: KDD Bigdas Canada (2017)
8.
Zurück zum Zitat Hebrail G.; Marsais J.: Experiments of textual data analysis at electricite de France. In: IFCS- 92 of the International Federation of Classification Societies, pp. 569–576 (1992) Hebrail G.; Marsais J.: Experiments of textual data analysis at electricite de France. In: IFCS- 92 of the International Federation of Classification Societies, pp. 569–576 (1992)
9.
Zurück zum Zitat Feldman, R.; Dagan, I.: Knowledge discovery in textual databases (KDT). In: KDD-95, pp. 112–117 (1995) Feldman, R.; Dagan, I.: Knowledge discovery in textual databases (KDT). In: KDD-95, pp. 112–117 (1995)
10.
Zurück zum Zitat Harish, B.S.; Guru, D.S.; Manjunath, S.: Representation and classification of text documents: a brief review. In: IJCA Special Issue on “Recent Trends in Image Processing and Pattern Recognition, RTIPPR (2010) Harish, B.S.; Guru, D.S.; Manjunath, S.: Representation and classification of text documents: a brief review. In: IJCA Special Issue on “Recent Trends in Image Processing and Pattern Recognition, RTIPPR (2010)
11.
Zurück zum Zitat Beliga, S.; Mestrovic, A.; Ipsic, M.S.: An overview of graph based keyword extraction methods and approaches. J. Inf. Organ. Sci. 39(1), 1–20 (2015) Beliga, S.; Mestrovic, A.; Ipsic, M.S.: An overview of graph based keyword extraction methods and approaches. J. Inf. Organ. Sci. 39(1), 1–20 (2015)
12.
Zurück zum Zitat Han, J.; Kamber, M.; Pei, J.: Data Mining Concepts and Techniques. Morgan Kaufmann Publications, Burlington (2012)MATH Han, J.; Kamber, M.; Pei, J.: Data Mining Concepts and Techniques. Morgan Kaufmann Publications, Burlington (2012)MATH
13.
Zurück zum Zitat Ravi, K.; Ravi, R.: A survey on opinion mining and sentiment analysis: tasks, approaches and applications. Knowl. Based Syst. 89, 14–46 (2015) Ravi, K.; Ravi, R.: A survey on opinion mining and sentiment analysis: tasks, approaches and applications. Knowl. Based Syst. 89, 14–46 (2015)
14.
Zurück zum Zitat Montoyo, A.; Barco, P.M.; Balahur, A.: Subjectivity and sentiment analysis: an overview of the current state of the area and envisaged developments. Decis. Support Syst. 53, 675–679 (2012) Montoyo, A.; Barco, P.M.; Balahur, A.: Subjectivity and sentiment analysis: an overview of the current state of the area and envisaged developments. Decis. Support Syst. 53, 675–679 (2012)
15.
Zurück zum Zitat Salton, G.: A vector space model for automatic indexing. Inf. Retr. Lang. Process. 18(11), 613–620 (1975)MathSciNetMATH Salton, G.: A vector space model for automatic indexing. Inf. Retr. Lang. Process. 18(11), 613–620 (1975)MathSciNetMATH
16.
Zurück zum Zitat Sebastiani, F.: Machine learning in automated text categorization. ACM Comput. Surv. 34(1), 1–47 (2002) Sebastiani, F.: Machine learning in automated text categorization. ACM Comput. Surv. 34(1), 1–47 (2002)
17.
Zurück zum Zitat Kim, H.K.; Kim, H.; Cho, S.: Bag-of-concepts: comprehending document representation through clustering words in distributed representation. Neurocomputing 266(29), 336–352 (2017) Kim, H.K.; Kim, H.; Cho, S.: Bag-of-concepts: comprehending document representation through clustering words in distributed representation. Neurocomputing 266(29), 336–352 (2017)
18.
Zurück zum Zitat Xia, T.; Chai, Y.: An improvement to TF-IDF: term distribution based term weight algorithm. J. Softw. 6(3), 413–420 (2011) Xia, T.; Chai, Y.: An improvement to TF-IDF: term distribution based term weight algorithm. J. Softw. 6(3), 413–420 (2011)
19.
Zurück zum Zitat Zhang, K.; Narayanan, R.; Choudhary, A.: Voice of the customers: mining online customer reviews for product feature-based ranking. Workshop on Online Social Networks (2010) Zhang, K.; Narayanan, R.; Choudhary, A.: Voice of the customers: mining online customer reviews for product feature-based ranking. Workshop on Online Social Networks (2010)
20.
Zurück zum Zitat Tala, F.: A study of stemming effects on information retrieval in Bahasa Indonesia. Master thesis, Institute for Logic, Language and Computation, University of Amsterdam (2003) Tala, F.: A study of stemming effects on information retrieval in Bahasa Indonesia. Master thesis, Institute for Logic, Language and Computation, University of Amsterdam (2003)
21.
Zurück zum Zitat Jain, A.K.: Data clustering: 50 years beyond K-means. Pattern Recognit. Lett. 31(8), 651–666 (2010) Jain, A.K.: Data clustering: 50 years beyond K-means. Pattern Recognit. Lett. 31(8), 651–666 (2010)
22.
Zurück zum Zitat Luhn, H.O.: The automatic creation of literature abstracts. IBM J. Res. Dev. 2, 159–165 (1958)MathSciNet Luhn, H.O.: The automatic creation of literature abstracts. IBM J. Res. Dev. 2, 159–165 (1958)MathSciNet
23.
Zurück zum Zitat Willett, P.: Recent trends in hierarchical document clustering: a critical review. Inf. Process. Manag. 24(5), 577–597 (1988) Willett, P.: Recent trends in hierarchical document clustering: a critical review. Inf. Process. Manag. 24(5), 577–597 (1988)
24.
Zurück zum Zitat Cutting, D.; Karger, D.R.; Pedersen, J.O.; Tukey, J.W.: Scatter/gather: a cluster-based approach to browsing large document collections. In: 15th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 318–329 (1992) Cutting, D.; Karger, D.R.; Pedersen, J.O.; Tukey, J.W.: Scatter/gather: a cluster-based approach to browsing large document collections. In: 15th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 318–329 (1992)
26.
Zurück zum Zitat Johnson, S.C.: Hierarchical clustering schemes. Psychometrika 2, 241–254 (1967)MATH Johnson, S.C.: Hierarchical clustering schemes. Psychometrika 2, 241–254 (1967)MATH
27.
Zurück zum Zitat Ghosh, S.; Dubey, S.K.: Comparative analysis of k-means and fuzzy c-means algorithms. Int. J. Adv. Comput. Sci. Appl. 4(4), 35–39 (2013) Ghosh, S.; Dubey, S.K.: Comparative analysis of k-means and fuzzy c-means algorithms. Int. J. Adv. Comput. Sci. Appl. 4(4), 35–39 (2013)
29.
Zurück zum Zitat Xu, R.; Wunsch, D.C.: Survey on clustering algorithms. IEEE Trans. Neural Netw. 16(3), 645–678 (2005) Xu, R.; Wunsch, D.C.: Survey on clustering algorithms. IEEE Trans. Neural Netw. 16(3), 645–678 (2005)
30.
Zurück zum Zitat Pelleg, D.; Moore, A.: X-means: extending k-means with efficient estimation of the number of clusters. In: 17th International Conference on Machine Learning (ICML’00), pp. 727–734 (2000) Pelleg, D.; Moore, A.: X-means: extending k-means with efficient estimation of the number of clusters. In: 17th International Conference on Machine Learning (ICML’00), pp. 727–734 (2000)
31.
Zurück zum Zitat Faguo, Z.; Fan, Z.; Bingru, Y.: Research on short text classification algorithm based on statistics and rules, In: Third International Symposium on Electronic Commerce and Security, pp. 3–79 (2010) Faguo, Z.; Fan, Z.; Bingru, Y.: Research on short text classification algorithm based on statistics and rules, In: Third International Symposium on Electronic Commerce and Security, pp. 3–79 (2010)
32.
Zurück zum Zitat Beleites, C.; Salzer, R.: Assessing and improving the stability of chemometric models in small sample size situations. Anal. Bioanal. Chem. 390, 1261–1271 (2008) Beleites, C.; Salzer, R.: Assessing and improving the stability of chemometric models in small sample size situations. Anal. Bioanal. Chem. 390, 1261–1271 (2008)
36.
Zurück zum Zitat Bafghi, E.P.: Clustering of customers based on shopping behavior and employing genetic algorithms. Eng. Technol. Appl. Sci. Res. 7(1), 1420–1424 (2017) Bafghi, E.P.: Clustering of customers based on shopping behavior and employing genetic algorithms. Eng. Technol. Appl. Sci. Res. 7(1), 1420–1424 (2017)
37.
Zurück zum Zitat Xu, J.; Wang, P.; Tian, G.; Xu, B.; Zhao, J.; Wang, F.; Hao, H.: Short text clustering via convolutional neural networks. In: NAACL-HLT, pp. 62–69 (2015) Xu, J.; Wang, P.; Tian, G.; Xu, B.; Zhao, J.; Wang, F.; Hao, H.: Short text clustering via convolutional neural networks. In: NAACL-HLT, pp. 62–69 (2015)
38.
Zurück zum Zitat Meila, M.; Heckerman, D.: An experimental comparison of model-based clustering methods. Mach. Learn. 42, 9–29 (2001)MATH Meila, M.; Heckerman, D.: An experimental comparison of model-based clustering methods. Mach. Learn. 42, 9–29 (2001)MATH
39.
Zurück zum Zitat Quan, X.; Liu, G.; Lu, Z.; Ni, X.; Liu, W.: Short text similarity based on probabilistic topics. Knowl. Inf. Syst. 25(3), 473–491 (2010) Quan, X.; Liu, G.; Lu, Z.; Ni, X.; Liu, W.: Short text similarity based on probabilistic topics. Knowl. Inf. Syst. 25(3), 473–491 (2010)
40.
Zurück zum Zitat Siddiqui, T.; Aalam, P.: Short text clustering; challenges and solutions: a literature review. Int. J. Math. Comput. Res. 3(6), 1025–1031 (2015) Siddiqui, T.; Aalam, P.: Short text clustering; challenges and solutions: a literature review. Int. J. Math. Comput. Res. 3(6), 1025–1031 (2015)
41.
Zurück zum Zitat Onan, A.; Korukoglu, S.; Bulut, H.: Ensemble of keyword extraction methods and classifiers in text classification. Expert Syst. Appl. 57, 232–247 (2016) Onan, A.; Korukoglu, S.; Bulut, H.: Ensemble of keyword extraction methods and classifiers in text classification. Expert Syst. Appl. 57, 232–247 (2016)
42.
Zurück zum Zitat Hotho, A.; Staab, S.; Stumme, G.: WordNet improves text document clustering. In: 26th Annual International ACM SIGIR Conference Semantic Web Workshop, pp. 541–5449 (2003) Hotho, A.; Staab, S.; Stumme, G.: WordNet improves text document clustering. In: 26th Annual International ACM SIGIR Conference Semantic Web Workshop, pp. 541–5449 (2003)
43.
Zurück zum Zitat Hu, X.; Zhang, X.; Lu, C.; Park, E.K.; Zhou, X.: Exploiting Wikipedia as external knowledge for document clustering. In: 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 389–396 (2009) Hu, X.; Zhang, X.; Lu, C.; Park, E.K.; Zhou, X.: Exploiting Wikipedia as external knowledge for document clustering. In: 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 389–396 (2009)
44.
Zurück zum Zitat Bollegala, D.; Matsuo, Y.; Ishizuka, M.: A web search engine-based approach to measure semantic similarity between words. IEEE Trans. Knowl. Data Eng. 23(7), 977–990 (2011) Bollegala, D.; Matsuo, Y.; Ishizuka, M.: A web search engine-based approach to measure semantic similarity between words. IEEE Trans. Knowl. Data Eng. 23(7), 977–990 (2011)
45.
Zurück zum Zitat Shami, M.; Heilman, T.D.: A web-based kernel function for measuring the similarity of short text snippets. In: 15th International Conference 2006 on World Wide Web, pp. 377–386 (2006) Shami, M.; Heilman, T.D.: A web-based kernel function for measuring the similarity of short text snippets. In: 15th International Conference 2006 on World Wide Web, pp. 377–386 (2006)
46.
Zurück zum Zitat Wang, J.; Zhou, Y.; Li, L.; Hu, B.; Hu, X.: Improving short text clustering performance with keyword expansion. In: Wang, H., Shen, Y., Huang, T., Zeng, Z. (eds.) The Sixth International Symposium on Neural Networks, Advances in Intelligent and Soft Computing, vol. 56, pp. 291–298. Springer, Berlin, Heidelberg (2009) Wang, J.; Zhou, Y.; Li, L.; Hu, B.; Hu, X.: Improving short text clustering performance with keyword expansion. In: Wang, H., Shen, Y., Huang, T., Zeng, Z. (eds.) The Sixth International Symposium on Neural Networks, Advances in Intelligent and Soft Computing, vol. 56, pp. 291–298. Springer, Berlin, Heidelberg (2009)
47.
Zurück zum Zitat Ni, X.; Quan, X.; Lu, Z.; Liu, W.; Hua, B.: Short text clustering by finding core terms. Knowl. Inf. Syst. 27(3), 345–365 (2011) Ni, X.; Quan, X.; Lu, Z.; Liu, W.; Hua, B.: Short text clustering by finding core terms. Knowl. Inf. Syst. 27(3), 345–365 (2011)
48.
Zurück zum Zitat Yin, J.; Wang, J.: A dirichlet multinomial mixture model-based approach for short text clustering. In: SIGKDD, pp. 233–242 (2014) Yin, J.; Wang, J.: A dirichlet multinomial mixture model-based approach for short text clustering. In: SIGKDD, pp. 233–242 (2014)
49.
Zurück zum Zitat Majumder, S.; Balaji, N.; Brey, K.; Fu, W.; Menzies, T.: 500 + times faster than deep learning (a case study exploring faster methods for text mining stackoverflow). In: Mining Software Repositories (MSR) IEEE/ACM 15th International Conference on ACM (2018) Majumder, S.; Balaji, N.; Brey, K.; Fu, W.; Menzies, T.: 500 + times faster than deep learning (a case study exploring faster methods for text mining stackoverflow). In: Mining Software Repositories (MSR) IEEE/ACM 15th International Conference on ACM (2018)
50.
Zurück zum Zitat Ye, M.; Zhang, P.; Nie, L.: Clustering sparse binary data with hierarchical Bayesian Bernoulli mixture model. Comput. Stat. Data Anal. 123, 32–49 (2018)MathSciNetMATH Ye, M.; Zhang, P.; Nie, L.: Clustering sparse binary data with hierarchical Bayesian Bernoulli mixture model. Comput. Stat. Data Anal. 123, 32–49 (2018)MathSciNetMATH
51.
Zurück zum Zitat Wong, C.C.; Chen, C.C.: Data clustering by grey relational analysis. J. Grey Syst. 10(3), 281–288 (1998) Wong, C.C.; Chen, C.C.: Data clustering by grey relational analysis. J. Grey Syst. 10(3), 281–288 (1998)
52.
Zurück zum Zitat Yeh, M.F.: Data clustering via grey relational pattern analysis. J. Grey Syst. 14(3), 259–264 (2002) Yeh, M.F.: Data clustering via grey relational pattern analysis. J. Grey Syst. 14(3), 259–264 (2002)
53.
Zurück zum Zitat Chang, K.C.; Yeh, F.: Grey relational analysis based approach for data clustering. IEE Proc. Vis. Image Signal Process. 152(2), 165–172 (2005) Chang, K.C.; Yeh, F.: Grey relational analysis based approach for data clustering. IEE Proc. Vis. Image Signal Process. 152(2), 165–172 (2005)
54.
Zurück zum Zitat Pakkar, M.S.: An integrated approach to grey relational analysis, analytic hierarchy process and data envelopment analysis. J. Cent. Cathedra Bus. Econ. Res. J. 9(1), 71–86 (2016) Pakkar, M.S.: An integrated approach to grey relational analysis, analytic hierarchy process and data envelopment analysis. J. Cent. Cathedra Bus. Econ. Res. J. 9(1), 71–86 (2016)
55.
Zurück zum Zitat Wu, W.H.; Lin, C.T.; Peng, K.H.; Huang, C.C.: Applying hierarchical grey relation clustering analysis to geographical information systems—a case study of the hospitals in Taipei city. Expert Syst. Appl. 39, 7247–7254 (2012) Wu, W.H.; Lin, C.T.; Peng, K.H.; Huang, C.C.: Applying hierarchical grey relation clustering analysis to geographical information systems—a case study of the hospitals in Taipei city. Expert Syst. Appl. 39, 7247–7254 (2012)
56.
57.
Zurück zum Zitat Liu, S.; Lin, Y.: Grey Information Theory and Practical Applications. Springer, New York (2006) Liu, S.; Lin, Y.: Grey Information Theory and Practical Applications. Springer, New York (2006)
58.
Zurück zum Zitat Liu, S.; Forrest, J.; Yang, Y.: A brief introduction to grey systems theory. Grey Syst. Theory Appl. 2(2), 89–104 (2012) Liu, S.; Forrest, J.; Yang, Y.: A brief introduction to grey systems theory. Grey Syst. Theory Appl. 2(2), 89–104 (2012)
59.
Zurück zum Zitat Yıldırım, B.F.: Gri ilişkisel analiz. In: Yıldırım, B.F., Önder, E. (eds.) Çok Kriterli Karar Verme Yöntemleri, pp. 229–236. Dora Basım Yayın, Bursa, Turkey (2015) Yıldırım, B.F.: Gri ilişkisel analiz. In: Yıldırım, B.F., Önder, E. (eds.) Çok Kriterli Karar Verme Yöntemleri, pp. 229–236. Dora Basım Yayın, Bursa, Turkey (2015)
60.
Zurück zum Zitat Jin, X.: Grey relational clustering method and its application. J. Grey Syst. 3, 181–188 (1993)MATH Jin, X.: Grey relational clustering method and its application. J. Grey Syst. 3, 181–188 (1993)MATH
62.
Zurück zum Zitat Ertugrul, I.; Oztas, T.; Ozcil, A.; Oztas, G. Z.: Grey relational analysis approach in academic performance comparison of university: a case study of Turkish universities. Eur. Sci. J. June 2016 Special edition, pp. 128–139 (2016) Ertugrul, I.; Oztas, T.; Ozcil, A.; Oztas, G. Z.: Grey relational analysis approach in academic performance comparison of university: a case study of Turkish universities. Eur. Sci. J. June 2016 Special edition, pp. 128–139 (2016)
63.
Zurück zum Zitat Wilbur, W.J.; Sirotkin, K.: The automatic identification of stop words. J. Inf. Sci. 18, 45–55 (1992) Wilbur, W.J.; Sirotkin, K.: The automatic identification of stop words. J. Inf. Sci. 18, 45–55 (1992)
65.
Zurück zum Zitat Jivani, A.G.: A comparative study of stemming algorithms. Int. J. Comput. Technol. Appl. 2(6), 1930–1938 (2011) Jivani, A.G.: A comparative study of stemming algorithms. Int. J. Comput. Technol. Appl. 2(6), 1930–1938 (2011)
66.
Zurück zum Zitat Powers, D.W.M.: Evaluation: from precision, recall and f-measure to ROC, informedness, markedness and correlation. J. Mach. Learn. Technol. 2(1), 37–63 (2011)MathSciNet Powers, D.W.M.: Evaluation: from precision, recall and f-measure to ROC, informedness, markedness and correlation. J. Mach. Learn. Technol. 2(1), 37–63 (2011)MathSciNet
67.
Zurück zum Zitat Rosenberg, A.; Hirschberg, J.: V-measure: a conditional entropy-based external cluster evaluation measure. In: Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Prague, pp. 410–420 (2007) Rosenberg, A.; Hirschberg, J.: V-measure: a conditional entropy-based external cluster evaluation measure. In: Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Prague, pp. 410–420 (2007)
68.
Zurück zum Zitat Nizam, H.; Akın, S.S.: Sosyal medyada makine öğrenmesi ile duygu analizinde dengeli ve dengesiz veri setlerinin performanslarının karşılaştırılması. In: XIX. Türkiye’de İnternet Konferansı, İzmir, pp. 129–136 (2014) Nizam, H.; Akın, S.S.: Sosyal medyada makine öğrenmesi ile duygu analizinde dengeli ve dengesiz veri setlerinin performanslarının karşılaştırılması. In: XIX. Türkiye’de İnternet Konferansı, İzmir, pp. 129–136 (2014)
69.
Zurück zum Zitat Chormunge, S.; Jena, S.: Efficiency and effectiveness of clustering algorithms for high dimensional data. Int. J. Comput. Appl. 125(11), 35–40 (2015) Chormunge, S.; Jena, S.: Efficiency and effectiveness of clustering algorithms for high dimensional data. Int. J. Comput. Appl. 125(11), 35–40 (2015)
70.
Zurück zum Zitat Flach, P.; Kull, M.: Precision-recall-gain curves: PR analysis done right. Adv. Neural. Inf. Process. Syst. 28, 838–846 (2015) Flach, P.; Kull, M.: Precision-recall-gain curves: PR analysis done right. Adv. Neural. Inf. Process. Syst. 28, 838–846 (2015)
71.
Zurück zum Zitat Maratea, A.; Petrosino, A.; Manzo, M.: Adjusted F-measure and kernel scaling for imbalanced data learning. Inf. Sci. 257, 331–341 (2014) Maratea, A.; Petrosino, A.; Manzo, M.: Adjusted F-measure and kernel scaling for imbalanced data learning. Inf. Sci. 257, 331–341 (2014)
73.
Zurück zum Zitat Liu, Y.; Cheng, J.; Yan, C.; Wu, X.; Chen, F.: Research on Matthews correlation coefficients metrics of personalized recommendation algorithm evaluation. Int. J. Hybrid Inf. Technol. 8(1), 163–172 (2015) Liu, Y.; Cheng, J.; Yan, C.; Wu, X.; Chen, F.: Research on Matthews correlation coefficients metrics of personalized recommendation algorithm evaluation. Int. J. Hybrid Inf. Technol. 8(1), 163–172 (2015)
Metadaten
Titel
A Novel Short Text Clustering Model Based on Grey System Theory
verfasst von
Hüseyin Fidan
Mehmet Erkan Yuksel
Publikationsdatum
15.10.2019
Verlag
Springer Berlin Heidelberg
Erschienen in
Arabian Journal for Science and Engineering / Ausgabe 4/2020
Print ISSN: 2193-567X
Elektronische ISSN: 2191-4281
DOI
https://doi.org/10.1007/s13369-019-04191-0

Weitere Artikel der Ausgabe 4/2020

Arabian Journal for Science and Engineering 4/2020 Zur Ausgabe

Research Article - Special Issue - Intelligent Computing and Interdisciplinary Applications

Empirical Evaluation of Automated Test Suite Generation and Optimization

Research Article - Special Issue - Intelligent Computing And Interdisciplinary Applications

An Integrated Word Embedding-Based Dual-Task Learning Method for Sentiment Analysis

Research Article - SPECIAL ISSUE - INTELLIGENT COMPUTING and INTERDISCIPLINARY APPLICATIONS

A Congestion Aware Route Suggestion Protocol for Traffic Management in Internet of Vehicles

Research Article - Computer Engineering and Computer Science

An Efficient Language-Independent Acoustic Emotion Classification System

    Marktübersichten

    Die im Laufe eines Jahres in der „adhäsion“ veröffentlichten Marktübersichten helfen Anwendern verschiedenster Branchen, sich einen gezielten Überblick über Lieferantenangebote zu verschaffen.