Skip to main content

2021 | OriginalPaper | Buchkapitel

Fake Review Classification Using Supervised Machine Learning

verfasst von : Hanif Khan, Muhammad Usama Asghar, Muhammad Zubair Asghar, Gautam Srivastava, Praveen Kumar Reddy Maddikunta, Thippa Reddy Gadekallu

Erschienen in: Pattern Recognition. ICPR International Workshops and Challenges

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The revolution of social media has propelled the online community to take advantage of online reviews for not only posting feedback about the products, services, and other issues but also assists individuals to analyze user’s feedback for making purchase decisions, and companies for improving the quality of manufactured goods. However, the propagation of fake reviews has become an alarming issue, as it deceives online users while purchasing and promotes or demotes the reputation of competing brands. In this work, we propose a supervised learning-based technique for the detection of fake reviews from the online textual content. The study employs machine learning classifiers for bifurcating fake and genuine reviews. Experimental results are evaluated against different evaluation measures and the performance of the proposed system is compared with baseline works.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
3.
Zurück zum Zitat Pragna, B., RamaBa, M.: Spam detection using NLP techniques. Int. J. Recent Technol. Eng. (IJRTE) 8(2S11), 2423–2426 (2019). ISSN 2277-3878 Pragna, B., RamaBa, M.: Spam detection using NLP techniques. Int. J. Recent Technol. Eng. (IJRTE) 8(2S11), 2423–2426 (2019). ISSN 2277-3878
4.
Zurück zum Zitat Renuka, D.K., Hamsapriya, T., Chakkaravarthi, M.R., Surya, P.L.: Spam classification based on supervised learning using machine learning techniques. In: 2011 International Conference on Process Automation, Control and Computing, pp. 1–7. IEEE, July 2011 Renuka, D.K., Hamsapriya, T., Chakkaravarthi, M.R., Surya, P.L.: Spam classification based on supervised learning using machine learning techniques. In: 2011 International Conference on Process Automation, Control and Computing, pp. 1–7. IEEE, July 2011
6.
Zurück zum Zitat Ghai, R., Kumar, S., Pandey, A.C.: Spam detection using rating and review processing method. In: Panigrahi, B.K., Trivedi, M.C., Mishra, K.K., Tiwari, S., Singh, P.K. (eds.) Smart Innovations in Communication and Computational Sciences. AISC, vol. 670, pp. 189–198. Springer, Singapore (2019). https://doi.org/10.1007/978-981-10-8971-8_18CrossRef Ghai, R., Kumar, S., Pandey, A.C.: Spam detection using rating and review processing method. In: Panigrahi, B.K., Trivedi, M.C., Mishra, K.K., Tiwari, S., Singh, P.K. (eds.) Smart Innovations in Communication and Computational Sciences. AISC, vol. 670, pp. 189–198. Springer, Singapore (2019). https://​doi.​org/​10.​1007/​978-981-10-8971-8_​18CrossRef
7.
Zurück zum Zitat Narayan, R., Rout, J.K., Jena, S.K.: Review spam detection using opinion mining. In: Sa, P.K., Sahoo, M.N., Murugappan, M., Wu, Y., Majhi, B. (eds.) Progress in Intelligent Computing Techniques: Theory, Practice, and Applications. AISC, vol. 519, pp. 273–279. Springer, Singapore (2018). https://doi.org/10.1007/978-981-10-3376-6_30CrossRef Narayan, R., Rout, J.K., Jena, S.K.: Review spam detection using opinion mining. In: Sa, P.K., Sahoo, M.N., Murugappan, M., Wu, Y., Majhi, B. (eds.) Progress in Intelligent Computing Techniques: Theory, Practice, and Applications. AISC, vol. 519, pp. 273–279. Springer, Singapore (2018). https://​doi.​org/​10.​1007/​978-981-10-3376-6_​30CrossRef
8.
Zurück zum Zitat You, L., Peng, Q., Xiong, Z., He, D., Qiu, M., Zhang, X.: Integrating aspect analysis and local outlier factor for intelligent review spam detection. Future Gener. Comput. Syst. 102, 163–172 (2020)CrossRef You, L., Peng, Q., Xiong, Z., He, D., Qiu, M., Zhang, X.: Integrating aspect analysis and local outlier factor for intelligent review spam detection. Future Gener. Comput. Syst. 102, 163–172 (2020)CrossRef
9.
Zurück zum Zitat Mataoui, M.H., Zelmati, O., Boughaci, D., Chaouche, M., Lagoug, F.: A proposed spam detection approach for Arabic social networks content. In: 2017 International Conference on Mathematics and Information Technology (ICMIT), pp. 222–226. IEEE, December 2017 Mataoui, M.H., Zelmati, O., Boughaci, D., Chaouche, M., Lagoug, F.: A proposed spam detection approach for Arabic social networks content. In: 2017 International Conference on Mathematics and Information Technology (ICMIT), pp. 222–226. IEEE, December 2017
10.
Zurück zum Zitat Li, L., Qin, B., Ren, W., Liu, T.: Document representation and feature combination for deceptive spam review detection. Neurocomputing 254, 33–41 (2017)CrossRef Li, L., Qin, B., Ren, W., Liu, T.: Document representation and feature combination for deceptive spam review detection. Neurocomputing 254, 33–41 (2017)CrossRef
12.
Zurück zum Zitat Kashti, M.R.P., Prasad, P.S.: Enhancing NLP techniques for fake review detection. Int. Res. J. Eng. Technol. (IRJET) 6, 241–245 (2019) Kashti, M.R.P., Prasad, P.S.: Enhancing NLP techniques for fake review detection. Int. Res. J. Eng. Technol. (IRJET) 6, 241–245 (2019)
13.
Zurück zum Zitat Kokate, S., Tidke, B.: Fake review and brand spam detection using J48 classifier. IJCSIT Int. J. Comput. Sci. Inf. Technol. 6(4), 3523–3526 (2015) Kokate, S., Tidke, B.: Fake review and brand spam detection using J48 classifier. IJCSIT Int. J. Comput. Sci. Inf. Technol. 6(4), 3523–3526 (2015)
14.
Zurück zum Zitat Noekhah, S., Fouladfar, E., Salim, N., Ghorashi, S.H., Hozhabri, A.A.: A novel approach for opinion spam detection in e-commerce. In: Proceedings of the 8th IEEE International Conference on E-Commerce with Focus on E-Trust (2014) Noekhah, S., Fouladfar, E., Salim, N., Ghorashi, S.H., Hozhabri, A.A.: A novel approach for opinion spam detection in e-commerce. In: Proceedings of the 8th IEEE International Conference on E-Commerce with Focus on E-Trust (2014)
15.
Zurück zum Zitat Asghar, M.Z., Khan, A., Ahmad, S., Khan, I.A., Kundi, F.M.: A unified framework for creating domain dependent polarity lexicons from user generated reviews. PLoS ONE 10(10), e0140204 (2015)CrossRef Asghar, M.Z., Khan, A., Ahmad, S., Khan, I.A., Kundi, F.M.: A unified framework for creating domain dependent polarity lexicons from user generated reviews. PLoS ONE 10(10), e0140204 (2015)CrossRef
16.
Zurück zum Zitat Sun, C., Du, Q., Tian, G.: Exploiting product related review features for fake review detection. Math. Probl. Eng. 2016, 1–7 (2016) Sun, C., Du, Q., Tian, G.: Exploiting product related review features for fake review detection. Math. Probl. Eng. 2016, 1–7 (2016)
18.
Zurück zum Zitat Kiwanuka, F.N., Alqatawna, J.F., Amin, A.H.M., Paul, S., Faris, H.: Towards automated comprehensive feature engineering for spam detection (2019) Kiwanuka, F.N., Alqatawna, J.F., Amin, A.H.M., Paul, S., Faris, H.: Towards automated comprehensive feature engineering for spam detection (2019)
19.
Zurück zum Zitat Mukherjee, A., Venkataraman, V., Liu, B., Glance, N.: What yelp fake review filter might be doing? In: Seventh International AAAI Conference on Weblogs and Social Media, June 2013 Mukherjee, A., Venkataraman, V., Liu, B., Glance, N.: What yelp fake review filter might be doing? In: Seventh International AAAI Conference on Weblogs and Social Media, June 2013
20.
Zurück zum Zitat Algur, S.P., Biradar, J.G.: Rating consistency and review content based multiple stores review spam detection. In: 2015 International Conference on Information Processing (ICIP), pp. 685–690. IEEE, December 2015 Algur, S.P., Biradar, J.G.: Rating consistency and review content based multiple stores review spam detection. In: 2015 International Conference on Information Processing (ICIP), pp. 685–690. IEEE, December 2015
21.
Zurück zum Zitat Li, J., Ott, M., Cardie, C., Hovy, E.: Towards a general rule for identifying deceptive opinion spam. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1566–1576, June 2014 Li, J., Ott, M., Cardie, C., Hovy, E.: Towards a general rule for identifying deceptive opinion spam. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1566–1576, June 2014
23.
Zurück zum Zitat Prajapati, J., Bhatt, M., Prajapati, D.J.: Detection and summarization of genuine review using visual data mining. Int. J. Comput. Appl. 975, 8887 (2012) Prajapati, J., Bhatt, M., Prajapati, D.J.: Detection and summarization of genuine review using visual data mining. Int. J. Comput. Appl. 975, 8887 (2012)
24.
Zurück zum Zitat Fusilier, D.H., Cabrera, R.G., Montes, M., Rosso, P.: Using PU-learning to detect deceptive opinion spam. In: Proceedings of the 4th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, pp. 38–45, June 2013 Fusilier, D.H., Cabrera, R.G., Montes, M., Rosso, P.: Using PU-learning to detect deceptive opinion spam. In: Proceedings of the 4th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, pp. 38–45, June 2013
25.
Zurück zum Zitat Radulescu, C., Dinsoreanu, M., Potolea, R.: Identification of spam comments using natural language processing techniques. In: 2014 IEEE 10th International Conference on Intelligent Computer Communication and Processing (ICCP), pp. 29–35. IEEE, September 2014 Radulescu, C., Dinsoreanu, M., Potolea, R.: Identification of spam comments using natural language processing techniques. In: 2014 IEEE 10th International Conference on Intelligent Computer Communication and Processing (ICCP), pp. 29–35. IEEE, September 2014
26.
Zurück zum Zitat Reitermanova, Z.: Data splitting. In: WDS, vol. 10, pp. 31–36 (2010) Reitermanova, Z.: Data splitting. In: WDS, vol. 10, pp. 31–36 (2010)
27.
Zurück zum Zitat Nabil, M., Aly, M., Atiya, A.: ASTD: Arabic sentiment tweets dataset. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 2515–2519, September 2015 Nabil, M., Aly, M., Atiya, A.: ASTD: Arabic sentiment tweets dataset. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 2515–2519, September 2015
29.
Zurück zum Zitat Ejaz, A., Turabee, Z., Rahim, M., Khoja, S.: Opinion mining approaches on Amazon product reviews: a comparative study. In: 2017 International Conference on Information and Communication Technologies (ICICT), pp. 173–179. IEEE, December 2017 Ejaz, A., Turabee, Z., Rahim, M., Khoja, S.: Opinion mining approaches on Amazon product reviews: a comparative study. In: 2017 International Conference on Information and Communication Technologies (ICICT), pp. 173–179. IEEE, December 2017
30.
Zurück zum Zitat Khattak, A.M., Ullah, H., Khalid, H.A., Habib, A., Asghar, M.Z., Kundi, F.M.: Stock market trend prediction using supervised learning. In: Proceedings of the Tenth International Symposium on Information and Communication Technology, pp. 85–91, December 2019 Khattak, A.M., Ullah, H., Khalid, H.A., Habib, A., Asghar, M.Z., Kundi, F.M.: Stock market trend prediction using supervised learning. In: Proceedings of the Tenth International Symposium on Information and Communication Technology, pp. 85–91, December 2019
32.
Zurück zum Zitat Effrosynidis, D., Peikos, G., Symeonidis, S., Arampatzis, A.: DUTH at SemEval-2018 task 2: Emoji prediction in tweets. In: Proceedings of the 12th International Workshop on Semantic Evaluation, pp. 466–469, June 2018 Effrosynidis, D., Peikos, G., Symeonidis, S., Arampatzis, A.: DUTH at SemEval-2018 task 2: Emoji prediction in tweets. In: Proceedings of the 12th International Workshop on Semantic Evaluation, pp. 466–469, June 2018
33.
Zurück zum Zitat Nayak, A., Natarajan, D.: Comparative study of Naive Bayes, support vector machine and random forest classifiers in sentiment analysis of Twitter feeds. Int. J. Adv. Stud. Comput. Sci. Eng. 5, 14–17 (2016) Nayak, A., Natarajan, D.: Comparative study of Naive Bayes, support vector machine and random forest classifiers in sentiment analysis of Twitter feeds. Int. J. Adv. Stud. Comput. Sci. Eng. 5, 14–17 (2016)
34.
Zurück zum Zitat Ismail, H., Harous, S., Belkhoucshe, B.: A comparative analysis of machine learning classifiers for Twitter sentiment analysis. Res. Comput. Sci. 110, 71–83 (2016)CrossRef Ismail, H., Harous, S., Belkhoucshe, B.: A comparative analysis of machine learning classifiers for Twitter sentiment analysis. Res. Comput. Sci. 110, 71–83 (2016)CrossRef
35.
Zurück zum Zitat Babajide Mustapha, I., Saeed, F.: Bioactive molecule prediction using extreme gradient boosting. Molecules 21(8), 983 (2016)CrossRef Babajide Mustapha, I., Saeed, F.: Bioactive molecule prediction using extreme gradient boosting. Molecules 21(8), 983 (2016)CrossRef
36.
Zurück zum Zitat Van der Walt, C.M., Barnard, E.: Data characteristics that determine classifier performance (2006) Van der Walt, C.M., Barnard, E.: Data characteristics that determine classifier performance (2006)
37.
Zurück zum Zitat Kwon, O., Sim, J.M.: Effects of data set features on the performances of classification algorithms. Expert Syst. Appl. 40(5), 1847–1857 (2013)CrossRef Kwon, O., Sim, J.M.: Effects of data set features on the performances of classification algorithms. Expert Syst. Appl. 40(5), 1847–1857 (2013)CrossRef
38.
Zurück zum Zitat Reddy, G.T., et al.: Analysis of dimensionality reduction techniques on big data. IEEE Access 8, 54776–54788 (2020)CrossRef Reddy, G.T., et al.: Analysis of dimensionality reduction techniques on big data. IEEE Access 8, 54776–54788 (2020)CrossRef
39.
Zurück zum Zitat Maddikunta, P.K.R., Srivastava, G., Gadekallu, T.R., Deepa, N., Boopathy, P.: Predictive model for battery life in IoT networks. IET Intell. Transp. Syst. 14, 1388–1395 (2020)CrossRef Maddikunta, P.K.R., Srivastava, G., Gadekallu, T.R., Deepa, N., Boopathy, P.: Predictive model for battery life in IoT networks. IET Intell. Transp. Syst. 14, 1388–1395 (2020)CrossRef
40.
Zurück zum Zitat Ch, R., Srivastava, G., Gadekallu, T.R., Maddikunta, P.K.R., Bhattacharya, S.: Security and privacy of UAV data using blockchain technology. J. Inf. Secur. Appl. 55, 102670 (2020) Ch, R., Srivastava, G., Gadekallu, T.R., Maddikunta, P.K.R., Bhattacharya, S.: Security and privacy of UAV data using blockchain technology. J. Inf. Secur. Appl. 55, 102670 (2020)
41.
Zurück zum Zitat Baza, M., Mahmoud, M., Srivastava, G., Alasmary, W., Younis, M.: A light blockchain-powered privacy-preserving organization scheme for ride sharing services. In: 2020 IEEE 91st Vehicular Technology Conference (VTC2020-Spring), pp. 1–6. IEEE, May 2020 Baza, M., Mahmoud, M., Srivastava, G., Alasmary, W., Younis, M.: A light blockchain-powered privacy-preserving organization scheme for ride sharing services. In: 2020 IEEE 91st Vehicular Technology Conference (VTC2020-Spring), pp. 1–6. IEEE, May 2020
42.
Zurück zum Zitat MK, M., Srivastava, G., Somayaji, S.R.K., Gadekallu, T.R., Maddikunta, P.K.R., Bhattacharya, S.: An incentive based approach for COVID-19 using blockchain technology. arXiv preprint arXiv:2011.01468 (2020) MK, M., Srivastava, G., Somayaji, S.R.K., Gadekallu, T.R., Maddikunta, P.K.R., Bhattacharya, S.: An incentive based approach for COVID-19 using blockchain technology. arXiv preprint arXiv:​2011.​01468 (2020)
Metadaten
Titel
Fake Review Classification Using Supervised Machine Learning
verfasst von
Hanif Khan
Muhammad Usama Asghar
Muhammad Zubair Asghar
Gautam Srivastava
Praveen Kumar Reddy Maddikunta
Thippa Reddy Gadekallu
Copyright-Jahr
2021
DOI
https://doi.org/10.1007/978-3-030-68799-1_19

Premium Partner