Skip to main content
Top

2018 | OriginalPaper | Chapter

Classification of Amazon Book Reviews Based on Sentiment Analysis

Authors : K. S. Srujan, S. S. Nikhil, H. Raghav Rao, K. Karthik, B. S. Harish, H. M. Keerthi Kumar

Published in: Information Systems Design and Intelligent Applications

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Since the dawn of internet, e-shopping vendors like Amazon have grown in popularity. Customers express their opinion or sentiment by giving feedbacks in the form of text. Sentiment analysis is the process of determining the opinion or feeling expressed as either positive, negative or neutral. Capturing the exact sentiment of a review is a challenging task. In this paper, the various preprocessing techniques like HTML tags and URLs removal, punctuation, whitespace, special character removal and stemming are used to eliminate noise. The preprocessed data is represented using feature selection techniques like term frequency-inverse document frequency (TF–IDF). The classifiers like K-Nearest Neighbour (KNN), Decision Tree (DT), Support Vector Machine (SVM), Random Forest (RF) and Naive Bayes (NB) are used to classify sentiment of Amazon book reviews. Finally, we present a comparison of (i) Accuracy of various classifiers, (ii) Time elapsed by each classifier and (iii) Sentiment score of various books.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Pang, B., and Lee, L.: A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In Proceedings of the 42nd annual meeting on Association for Computational Linguistics, pp. 271 (2004). Pang, B., and Lee, L.: A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In Proceedings of the 42nd annual meeting on Association for Computational Linguistics, pp. 271 (2004).
2.
go back to reference Pang, B., and Lee, L.: A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts, in Proceedings of the 42nd annual meeting on Association for Computational Linguistics. Association for Computational Linguistics, pp. 271 (2004). Pang, B., and Lee, L.: A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts, in Proceedings of the 42nd annual meeting on Association for Computational Linguistics. Association for Computational Linguistics, pp. 271 (2004).
3.
go back to reference Tripathy, Abinash, Agarwal,A. and Santanu Kumar Rath.: Classification of Sentimental Reviews Using Machine Learning Techniques. Procedia Computer Science, Vol. 57, pp. 821–829 (2015). Tripathy, Abinash, Agarwal,A. and Santanu Kumar Rath.: Classification of Sentimental Reviews Using Machine Learning Techniques. Procedia Computer Science, Vol. 57, pp. 821–829 (2015).
4.
go back to reference Smithikrai, C.: Effectiveness of teaching with movies to promote positive characteristics and behaviors. Procedia-Social and Behavioral Sciences, Vol. 217, pp. 522–530 (2016). Smithikrai, C.: Effectiveness of teaching with movies to promote positive characteristics and behaviors. Procedia-Social and Behavioral Sciences, Vol. 217, pp. 522–530 (2016).
5.
go back to reference Shruti, T., and Choudhary, M.: Feature Based Opinion Mining on Movie Review. International Journal of Advanced Engineering Research and Science, Vol. 3, pp. 77–81. (2016). Shruti, T., and Choudhary, M.: Feature Based Opinion Mining on Movie Review. International Journal of Advanced Engineering Research and Science, Vol. 3, pp. 77–81. (2016).
6.
go back to reference Hu, M., and Liu, B.: Mining and summarizing customer reviews. In Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining ACM, pp. 168–177 (2004). Hu, M., and Liu, B.: Mining and summarizing customer reviews. In Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining ACM, pp. 168–177 (2004).
7.
go back to reference Pang, B., and Lee, L.: Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval, Vol. 2, No. 1–2, pp. 1–135 (2008). Pang, B., and Lee, L.: Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval, Vol. 2, No. 1–2, pp. 1–135 (2008).
8.
go back to reference Chesley, P., Vincent, B., Xu, L., and Srihari, R. K.: Using verbs and adjectives to automatically classify blog sentiment Training, Vol. 580, No. 263, pp. 233 (2006). Chesley, P., Vincent, B., Xu, L., and Srihari, R. K.: Using verbs and adjectives to automatically classify blog sentiment Training, Vol. 580, No. 263, pp. 233 (2006).
9.
go back to reference Choi, Y., and Cardie, C.: Adapting a polarity lexicon using integer linear programming for domain-specific sentiment classification. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Vol. 2, pp. 590–598 (2009). Choi, Y., and Cardie, C.: Adapting a polarity lexicon using integer linear programming for domain-specific sentiment classification. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Vol. 2, pp. 590–598 (2009).
10.
go back to reference Tan, L. K. W., Na, J. C., Theng, Y. L., and Chang, K.: Sentence-level sentiment polarity classification using a linguistic approach. In International Conference on Asian Digital Libraries Springer Berlin Heidelberg, pp. 77–87 (2011). Tan, L. K. W., Na, J. C., Theng, Y. L., and Chang, K.: Sentence-level sentiment polarity classification using a linguistic approach. In International Conference on Asian Digital Libraries Springer Berlin Heidelberg, pp. 77–87 (2011).
11.
go back to reference Liu, B.: Sentiment analysis and opinion mining. Synthesis lectures on human language technologies, Vol. 5, No. 1, pp. 1–167 (2012). Liu, B.: Sentiment analysis and opinion mining. Synthesis lectures on human language technologies, Vol. 5, No. 1, pp. 1–167 (2012).
12.
go back to reference Mohammad, S. M., and Turney, P. D.: Crowdsourcing a wordemotion association lexicon. Computational Intelligence, Vol. 29, No. 3, pp. 436–465 (2013). Mohammad, S. M., and Turney, P. D.: Crowdsourcing a wordemotion association lexicon. Computational Intelligence, Vol. 29, No. 3, pp. 436–465 (2013).
13.
go back to reference Turney P. D.: Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. In Proceedings of the 40th annual meeting on association for computational linguistics. Association for Computational Linguistics, pp. 417–424 (2002). Turney P. D.: Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. In Proceedings of the 40th annual meeting on association for computational linguistics. Association for Computational Linguistics, pp. 417–424 (2002).
14.
go back to reference Bhatt, Aashutosh, Chheda,H., Gawande,K.: Amazon Review Classification and Sentiment Analysis. International Journal of Computer Science and Information Technologies, Vol. 6, No. 6, pp. 5107–5110 (2015). Bhatt, Aashutosh, Chheda,H., Gawande,K.: Amazon Review Classification and Sentiment Analysis. International Journal of Computer Science and Information Technologies, Vol. 6, No. 6, pp. 5107–5110 (2015).
15.
go back to reference Haddi, Emma, Liu,X. and Yong Shi.: The role of text pre-processing in sentiment analysis. Procedia Computer Science, Vol. 17, pp. 26–32 (2013). Haddi, Emma, Liu,X. and Yong Shi.: The role of text pre-processing in sentiment analysis. Procedia Computer Science, Vol. 17, pp. 26–32 (2013).
16.
go back to reference Anand, D., and Naorem, D.: Semi-supervised Aspect Based Sentiment Analysis for Movies Using Review Filtering. Procedia Computer Science, Vol. 84, pp. 86–93 (2016). Anand, D., and Naorem, D.: Semi-supervised Aspect Based Sentiment Analysis for Movies Using Review Filtering. Procedia Computer Science, Vol. 84, pp. 86–93 (2016).
17.
go back to reference Mohammad, S. M., and Turney, P. D.: Emotions evoked by common words and phrases: Using Mechanical Turk to create an emotion lexicon. In Proceedings of the NAACL HLT 2010 workshop on computational approaches to analysis and generation of emotion in text Association for Computational Linguistics, pp. 26–34 (2010). Mohammad, S. M., and Turney, P. D.: Emotions evoked by common words and phrases: Using Mechanical Turk to create an emotion lexicon. In Proceedings of the NAACL HLT 2010 workshop on computational approaches to analysis and generation of emotion in text Association for Computational Linguistics, pp. 26–34 (2010).
20.
go back to reference Vijayarani, S., Ilamathi, M. J., and Nithya, M.: Preprocessing Techniques for Text Mining-An Overview. International Journal of Computer Science and Communication Networks, Vol. 5, No. 1, pp. 7–16 (2015). Vijayarani, S., Ilamathi, M. J., and Nithya, M.: Preprocessing Techniques for Text Mining-An Overview. International Journal of Computer Science and Communication Networks, Vol. 5, No. 1, pp. 7–16 (2015).
21.
go back to reference Trstenjak, B., Mikac, S., and Donko, D.: KNN with TF-IDF based Framework for Text Categorization. Procedia Engineering, Vol. 69, pp. 1356–1364 (2014). Trstenjak, B., Mikac, S., and Donko, D.: KNN with TF-IDF based Framework for Text Categorization. Procedia Engineering, Vol. 69, pp. 1356–1364 (2014).
22.
go back to reference McCallum A., Nigam.K .:A comparison of event models for naive bayes text classification, in AAAI-98 workshop on learning for text categorization, Citeseer, Vol. 752, pp. 41–48 (1998). McCallum A., Nigam.K .:A comparison of event models for naive bayes text classification, in AAAI-98 workshop on learning for text categorization, Citeseer, Vol. 752, pp. 41–48 (1998).
23.
go back to reference Ho T.K.: Random decision forests. In Document Analysis and Recognition, Proceedings of the Third International Conference on IEEE, Vol. 1, pp. 278–282 (1995). Ho T.K.: Random decision forests. In Document Analysis and Recognition, Proceedings of the Third International Conference on IEEE, Vol. 1, pp. 278–282 (1995).
Metadata
Title
Classification of Amazon Book Reviews Based on Sentiment Analysis
Authors
K. S. Srujan
S. S. Nikhil
H. Raghav Rao
K. Karthik
B. S. Harish
H. M. Keerthi Kumar
Copyright Year
2018
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-7512-4_40

Premium Partner