Skip to main content

2018 | OriginalPaper | Buchkapitel

Interactions Between Term Weighting and Feature Selection Methods on the Sentiment Analysis of Turkish Reviews

verfasst von : Tuba Parlar, Selma Ayşe Özel, Fei Song

Erschienen in: Computational Linguistics and Intelligent Text Processing

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Term weighting methods assign appropriate weights to the terms in a document so that more important terms receive higher weights for the text representation. In this study, we consider four term weighting and three feature selection methods and investigate how these term weighting methods respond to the reduced text representation. We conduct experiments on five Turkish review datasets so that we can establish baselines and compare the performance of these term weighting methods. We test these methods on the English reviews so that we can identify their differences with the Turkish reviews. We show that both tf and tp weighting methods are the best for the Turkish, while tp is the best for the English reviews. When feature selection is applied, tf * idf method with DFD and χ2 has the highest accuracies for the Turkish, while tf * idf and tp methods with χ2 have the best performance for the English reviews.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retr. 2(1–2), 1–135 (2008)CrossRef Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retr. 2(1–2), 1–135 (2008)CrossRef
2.
Zurück zum Zitat Nicholls, C., Song, F.: Comparison of feature selection methods for sentiment analysis. In: 23rd Canadian conference on Advances in Artificial Intelligence (AI 2010), pp. 286–289 (2010) Nicholls, C., Song, F.: Comparison of feature selection methods for sentiment analysis. In: 23rd Canadian conference on Advances in Artificial Intelligence (AI 2010), pp. 286–289 (2010)
3.
Zurück zum Zitat Erogul, U.: Sentiment analysis in Turkish. Master thesis, Middle East Technical University, Turkey (2009) Erogul, U.: Sentiment analysis in Turkish. Master thesis, Middle East Technical University, Turkey (2009)
4.
Zurück zum Zitat Çetin, M., Amasyali, M.F.: Active learning for Turkish sentiment analysis. In: IEEE International Symposium on Innovations in Intelligent Systems and Applications (INISTA) (2013) Çetin, M., Amasyali, M.F.: Active learning for Turkish sentiment analysis. In: IEEE International Symposium on Innovations in Intelligent Systems and Applications (INISTA) (2013)
5.
Zurück zum Zitat Akba, F., Uçan, A., Sezer, E., Sever, H.: Assessment of feature selection metrics for sentiment analysis: Turkish movie reviews. In: 8th European Conference on Data Mining, pp. 180–184 (2014) Akba, F., Uçan, A., Sezer, E., Sever, H.: Assessment of feature selection metrics for sentiment analysis: Turkish movie reviews. In: 8th European Conference on Data Mining, pp. 180–184 (2014)
6.
Zurück zum Zitat Kaya, M., Fidan, G., Toroslu, I.: Sentiment analysis of Turkish political news. In: IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology (WI-IAT), vol. 1, pp. 174–180 (2012) Kaya, M., Fidan, G., Toroslu, I.: Sentiment analysis of Turkish political news. In: IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology (WI-IAT), vol. 1, pp. 174–180 (2012)
7.
Zurück zum Zitat Sevindi, B.I.: Comparison of supervised and dictionary based sentiment analysis approaches on Turkish text. Master thesis, Gazi University, Turkey (2013) Sevindi, B.I.: Comparison of supervised and dictionary based sentiment analysis approaches on Turkish text. Master thesis, Gazi University, Turkey (2013)
8.
Zurück zum Zitat Yan, J., Liu, N., Zhang, B., Yan, S., Chen, Z., Cheng, Q., et al.: OCFS: optimal orthogonal centroid feature selection for text categorization. In: 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 122–129 (2005) Yan, J., Liu, N., Zhang, B., Yan, S., Chen, Z., Cheng, Q., et al.: OCFS: optimal orthogonal centroid feature selection for text categorization. In: 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 122–129 (2005)
9.
Zurück zum Zitat Pang, B., Lee, L., Vaithyanathan, V.: Thumbs up? Sentiment classification using machine learning techniques. In: Proceedings of Conference on Empirical Methods in Natural Language Processing, Morristown, pp. 79–86 (2002) Pang, B., Lee, L., Vaithyanathan, V.: Thumbs up? Sentiment classification using machine learning techniques. In: Proceedings of Conference on Empirical Methods in Natural Language Processing, Morristown, pp. 79–86 (2002)
10.
Zurück zum Zitat O’Keefe, T., Koprinska, I.: Feature selection and weighting methods. In: 14th Australian Document Computing Symposium on Sentiment Analysis, Sydney, Australia (2009) O’Keefe, T., Koprinska, I.: Feature selection and weighting methods. In: 14th Australian Document Computing Symposium on Sentiment Analysis, Sydney, Australia (2009)
11.
Zurück zum Zitat McCallum, A., Nigam, K.A.: Comparison of event models for Naive Bayes text classification. In: Proceedings of AAAI (1998) McCallum, A., Nigam, K.A.: Comparison of event models for Naive Bayes text classification. In: Proceedings of AAAI (1998)
12.
Zurück zum Zitat Robertson, S.E., Jones, K.S.: Relevance Weighting of Search Terms, pp. 143–160. Taylor Graham Publishing, London (1988) Robertson, S.E., Jones, K.S.: Relevance Weighting of Search Terms, pp. 143–160. Taylor Graham Publishing, London (1988)
13.
Zurück zum Zitat Lan, M., Tan, C.L., Su, J., Lu, Y.: Supervised and traditional term weighting methods for automatic text categorization. IEEE Trans. Pattern Anal. Mach. Intell. 31, 721–735 (2009). IEEE Computer SocietyCrossRef Lan, M., Tan, C.L., Su, J., Lu, Y.: Supervised and traditional term weighting methods for automatic text categorization. IEEE Trans. Pattern Anal. Mach. Intell. 31, 721–735 (2009). IEEE Computer SocietyCrossRef
14.
Zurück zum Zitat Demirtas, E., Pechenizkiy, M.: Cross-lingual polarity detection with machine translation. In: 2nd International Workshop on Issues of Sentiment Discovery and Opinion Mining (WISDOM 2013), vol. 9 (2013) Demirtas, E., Pechenizkiy, M.: Cross-lingual polarity detection with machine translation. In: 2nd International Workshop on Issues of Sentiment Discovery and Opinion Mining (WISDOM 2013), vol. 9 (2013)
15.
Zurück zum Zitat Pang, B., Lee, L.: A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of Annual Meeting for the Association of Computational Linguists (2004) Pang, B., Lee, L.: A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of Annual Meeting for the Association of Computational Linguists (2004)
16.
Zurück zum Zitat Blitzer, J., Dredze, M., Pereira, F.: Biographies, bollywood, boom-boxes and blenders: domain adaptation for sentiment classification. Association of Computational Linguistics (ACL) (2007) Blitzer, J., Dredze, M., Pereira, F.: Biographies, bollywood, boom-boxes and blenders: domain adaptation for sentiment classification. Association of Computational Linguistics (ACL) (2007)
17.
Zurück zum Zitat Han, J., Kamber, M.: Data Mining: Concepts and Techniques, 2nd edn. Morgan Kaufmann Publishing, San Francisco (2006)MATH Han, J., Kamber, M.: Data Mining: Concepts and Techniques, 2nd edn. Morgan Kaufmann Publishing, San Francisco (2006)MATH
Metadaten
Titel
Interactions Between Term Weighting and Feature Selection Methods on the Sentiment Analysis of Turkish Reviews
verfasst von
Tuba Parlar
Selma Ayşe Özel
Fei Song
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-75487-1_26

Premium Partner