Skip to main content
Erschienen in: Journal of Intelligent Information Systems 2/2018

29.03.2017

State-of-art approaches for review spammer detection: a survey

verfasst von: Rupesh Kumar Dewang, Anil Kumar Singh

Erschienen in: Journal of Intelligent Information Systems | Ausgabe 2/2018

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

E-commerce websites are now favourite for shopping comfortably at home without any burden of going to market. Their success depends upon the reviews written by the consumers who used particular products and subsequently shared their experiences with that product. The reviews also affects the buying decision of customer. Because of this reason the activity of fake reviews posting is increasing. The brand competitors of the product or the company itself may involve in posting fraud reviews to gain more profit. Such fraudulent reviews are spam review that badly affects the decision choice of the prospective consumer of the products. Many customers are misguided due to fake reviews. The person, who writes the fake reviews, is called the spammer. Identification of spammers is indirectly helpful in identifying whether the reviews are spam or not. The detection of review spammers is serious concern for the E-commerce business. To help researchers in this vibrant area, we present the state of art approaches for review spammer detection. This paper presents a comprehensive survey of the existing spammer detection approaches describing the features used for individual and group spammer detection, dataset summary with details of reviews, products and reviewers. The main aim of this paper is to provide a basic, comprehensive and comparative study of current research on detecting review spammer using machine learning techniques and give future directions. This paper also provides a concise summary of published research to help potential researchers in this area to innovate new techniques.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Akoglu, L., Chandy, R., & Faloutsos, C. (2013). Opinion fraud detection in online reviews by network effects. ICWSM, 13, 2–11. Akoglu, L., Chandy, R., & Faloutsos, C. (2013). Opinion fraud detection in online reviews by network effects. ICWSM, 13, 2–11.
Zurück zum Zitat Aye, C.M., & Oo, K.M. (2014). Review spammer detection by using behaviors based scoring methods. In Proceedings of international conference on advances in engineering and technology. Aye, C.M., & Oo, K.M. (2014). Review spammer detection by using behaviors based scoring methods. In Proceedings of international conference on advances in engineering and technology.
Zurück zum Zitat Baeza-Yates, R.A., Castillo, C., López, V., & Telefónica, C. (2005). Pagerank increase under different collusion topologies. In AIRWeb (Vol. 5, pp. 25–32). Baeza-Yates, R.A., Castillo, C., López, V., & Telefónica, C. (2005). Pagerank increase under different collusion topologies. In AIRWeb (Vol. 5, pp. 25–32).
Zurück zum Zitat Berger, P, Hennig, P., Schoenberg, M., & Meinel, C. (2015). Blog, forum or newspaper? Web genre detection using svms. In 2015 IEEE/WIC/ACM international conference on web intelligence and intelligent agent technology (WI-IAT) (pp. 364–68). IEEE. Berger, P, Hennig, P., Schoenberg, M., & Meinel, C. (2015). Blog, forum or newspaper? Web genre detection using svms. In 2015 IEEE/WIC/ACM international conference on web intelligence and intelligent agent technology (WI-IAT) (pp. 364–68). IEEE.
Zurück zum Zitat Carpinter, J., & Hunt, R. (2006). Tightening the net: a review of current and next generation spam filtering tools. Computers & Security, 25(8), 566–578.CrossRef Carpinter, J., & Hunt, R. (2006). Tightening the net: a review of current and next generation spam filtering tools. Computers & Security, 25(8), 566–578.CrossRef
Zurück zum Zitat Choo, E., Yu, T., & Chi, M. (2015). Detecting opinion spammer groups through community discovery and sentiment analysis. In Data and applications security and privacy XXIX (pp. 170–187). Springer. Choo, E., Yu, T., & Chi, M. (2015). Detecting opinion spammer groups through community discovery and sentiment analysis. In Data and applications security and privacy XXIX (pp. 170–187). Springer.
Zurück zum Zitat Choudhury, S., Dey, B., & Kumar, S. (2005). Spam: a threat to network security in digital library and information centres. Choudhury, S., Dey, B., & Kumar, S. (2005). Spam: a threat to network security in digital library and information centres.
Zurück zum Zitat Crawford, M., Khoshgoftaar, T.M., Prusa, J.D., Richter, A.N., & Al Najada, H. (2015). Survey of review spam detection using machine learning techniques. Journal of Big Data, 2(1), 1–24.CrossRef Crawford, M., Khoshgoftaar, T.M., Prusa, J.D., Richter, A.N., & Al Najada, H. (2015). Survey of review spam detection using machine learning techniques. Journal of Big Data, 2(1), 1–24.CrossRef
Zurück zum Zitat Duh, A, Štiglic, G., & Korošak, D. (2013). Enhancing identification of opinion spammer groups. In Proceedings of international conference on making sense of converging media (Vol. 326). ACM. Duh, A, Štiglic, G., & Korošak, D. (2013). Enhancing identification of opinion spammer groups. In Proceedings of international conference on making sense of converging media (Vol. 326). ACM.
Zurück zum Zitat Esuli, A., & Sebastiani, F. (2006). Sentiwordnet: a publicly available lexical resource for opinion mining. In Proceedings of LREC (Vol. 6, pp. 417–422). Citeseer. Esuli, A., & Sebastiani, F. (2006). Sentiwordnet: a publicly available lexical resource for opinion mining. In Proceedings of LREC (Vol. 6, pp. 417–422). Citeseer.
Zurück zum Zitat Fayazbakhsh, S.K., & Sinha, J. (2012). Review spam detection: a network-based approach. Final Project Report: CSE, 590. Fayazbakhsh, S.K., & Sinha, J. (2012). Review spam detection: a network-based approach. Final Project Report: CSE, 590.
Zurück zum Zitat Fei, G., Mukherjee, A., Liu, B., Hsu, M., Castellanos, M., & Ghosh, R. (2013). Exploiting burstiness in reviews for review spammer detection. ICWSM, 13, 175–184. Fei, G., Mukherjee, A., Liu, B., Hsu, M., Castellanos, M., & Ghosh, R. (2013). Exploiting burstiness in reviews for review spammer detection. ICWSM, 13, 175–184.
Zurück zum Zitat Fleiss, J.L. (1971). Measuring nominal scale agreement among many raters. Psychological Bulletin, 76(5), 378.CrossRef Fleiss, J.L. (1971). Measuring nominal scale agreement among many raters. Psychological Bulletin, 76(5), 378.CrossRef
Zurück zum Zitat Gyongyi, Z., & Garcia-Molina, H. (2005). Web spam taxonomy. In First international workshop on adversarial information retrieval on the web (AIRWeb 2005). Gyongyi, Z., & Garcia-Molina, H. (2005). Web spam taxonomy. In First international workshop on adversarial information retrieval on the web (AIRWeb 2005).
Zurück zum Zitat Gyöngyi, Z., Garcia-Molina, H., & Pedersen, J (2004). Combating web spam with trustrank. In Proceedings of the thirtieth international conference on very large data bases - volume 30, VLDB ’04 (pp. 576–587). VLDB Endowment. Gyöngyi, Z., Garcia-Molina, H., & Pedersen, J (2004). Combating web spam with trustrank. In Proceedings of the thirtieth international conference on very large data bases - volume 30, VLDB ’04 (pp. 576–587). VLDB Endowment.
Zurück zum Zitat Heydari, A., Ali Tavakoli, M., Salim, N., & Heydari, Z. (2015). Detection of review spam: a survey. Expert Systems with Applications, 42(7), 3634–3642.CrossRef Heydari, A., Ali Tavakoli, M., Salim, N., & Heydari, Z. (2015). Detection of review spam: a survey. Expert Systems with Applications, 42(7), 3634–3642.CrossRef
Zurück zum Zitat Hinde, S. (2002). Spam, scams, chains, hoaxes and other junk mail. Computers & Security, 21(7), 592–606.CrossRef Hinde, S. (2002). Spam, scams, chains, hoaxes and other junk mail. Computers & Security, 21(7), 592–606.CrossRef
Zurück zum Zitat Huang, J., Qian, T., He, G., Zhong, M., & Peng, Q. (2013). Detecting professional spam reviewers. In Advanced data mining and applications (pp. 288–299). Springer. Huang, J., Qian, T., He, G., Zhong, M., & Peng, Q. (2013). Detecting professional spam reviewers. In Advanced data mining and applications (pp. 288–299). Springer.
Zurück zum Zitat Jindal, N., & Liu, B. (2007). Analyzing and detecting review spam. In Seventh IEEE international conference on data mining, 2007. ICDM 2007 (pp. 547–552). IEEE. Jindal, N., & Liu, B. (2007). Analyzing and detecting review spam. In Seventh IEEE international conference on data mining, 2007. ICDM 2007 (pp. 547–552). IEEE.
Zurück zum Zitat Jindal, N., & Liu, B. (2008). Opinion spam and analysis. In Proceedings of the 2008 international conference on web search and data mining, WSDM ’08 (pp. 219–230). ACM, New York. Jindal, N., & Liu, B. (2008). Opinion spam and analysis. In Proceedings of the 2008 international conference on web search and data mining, WSDM ’08 (pp. 219–230). ACM, New York.
Zurück zum Zitat Jiang, B., Chen, B., & et al. (2013). Detecting product review spammers using activity model. In 2013 international conference on advanced computer science and electronics information (ICACSEI 2013). Atlantis Press. Jiang, B., Chen, B., & et al. (2013). Detecting product review spammers using activity model. In 2013 international conference on advanced computer science and electronics information (ICACSEI 2013). Atlantis Press.
Zurück zum Zitat Kim, S., Park, H., & Lebanon, G. (2014). Fast spammer detection using structural rank. arXiv:1407.7072. Kim, S., Park, H., & Lebanon, G. (2014). Fast spammer detection using structural rank. arXiv:1407.​7072.
Zurück zum Zitat Li, W., Zhong, N., & Liu, C. (2006). Combining multiple email filters based on multivariate statistical analysis. In Foundations of intelligent systems (pp. 729–738). Springer. Li, W., Zhong, N., & Liu, C. (2006). Combining multiple email filters based on multivariate statistical analysis. In Foundations of intelligent systems (pp. 729–738). Springer.
Zurück zum Zitat Liang, D., Liu, X., & Shen, H. (2014). Detecting spam reviewers by combing reviewer feature and relationship. In 2014 international conference on informative and cybernetics for computational social systems (ICCSS) (pp. 102–107). IEEE. Liang, D., Liu, X., & Shen, H. (2014). Detecting spam reviewers by combing reviewer feature and relationship. In 2014 international conference on informative and cybernetics for computational social systems (ICCSS) (pp. 102–107). IEEE.
Zurück zum Zitat Lim, E. -P., Nguyen, V. -A., Jindal, N., Liu, B., & Lauw, H.W. (2010). Detecting product review spammers using rating behaviors. In Proceedings of the 19th ACM international conference on information and knowledge management, CIKM ’10 (pp. 939–948). ACM: New York, NY, USA. Lim, E. -P., Nguyen, V. -A., Jindal, N., Liu, B., & Lauw, H.W. (2010). Detecting product review spammers using rating behaviors. In Proceedings of the 19th ACM international conference on information and knowledge management, CIKM ’10 (pp. 939–948). ACM: New York, NY, USA.
Zurück zum Zitat Lu, Y., Zhang, L., Xiao, Y., & Li, Y. (2013). Simultaneously detecting fake reviews and review spammers using factor graph model. In Proceedings of the 5th annual ACM web science conference (pp. 225–233). ACM. Lu, Y., Zhang, L., Xiao, Y., & Li, Y. (2013). Simultaneously detecting fake reviews and review spammers using factor graph model. In Proceedings of the 5th annual ACM web science conference (pp. 225–233). ACM.
Zurück zum Zitat Luckner, M., Gad, M., & Sobkowiak, P. (2014). Stable web spam detection using features based on lexical items. Computers & Security, 46, 79–93.CrossRef Luckner, M., Gad, M., & Sobkowiak, P. (2014). Stable web spam detection using features based on lexical items. Computers & Security, 46, 79–93.CrossRef
Zurück zum Zitat Ma, Y., & Li, F. (2012). Detecting review spam: challenges and opportunities. In 2012 8th international conference on collaborative computing: networking, applications and worksharing (CollaborateCom) (pp. 651–654). IEEE. Ma, Y., & Li, F. (2012). Detecting review spam: challenges and opportunities. In 2012 8th international conference on collaborative computing: networking, applications and worksharing (CollaborateCom) (pp. 651–654). IEEE.
Zurück zum Zitat McAuley, J., & Leskovec, J. (2013). Hidden factors and hidden topics: understanding rating dimensions with review text. In Proceedings of the 7th ACM conference on recommender systems (pp. 165–172). ACM. McAuley, J., & Leskovec, J. (2013). Hidden factors and hidden topics: understanding rating dimensions with review text. In Proceedings of the 7th ACM conference on recommender systems (pp. 165–172). ACM.
Zurück zum Zitat Mukherjee, A., Liu, B., Wang, J., Glance, N., & Jindal, N. (2011). Detecting group review spam. In Proceedings of the 20th international conference companion on World wide web (pp. 93–94). ACM. Mukherjee, A., Liu, B., Wang, J., Glance, N., & Jindal, N. (2011). Detecting group review spam. In Proceedings of the 20th international conference companion on World wide web (pp. 93–94). ACM.
Zurück zum Zitat Mukherjee, A., Liu, B., & Glance, N. (2012). Spotting fake reviewer groups in consumer reviews. In Proceedings of the 21st international conference on World Wide Web, WWW ’12 (pp. 191–200). ACM: New York. Mukherjee, A., Liu, B., & Glance, N. (2012). Spotting fake reviewer groups in consumer reviews. In Proceedings of the 21st international conference on World Wide Web, WWW ’12 (pp. 191–200). ACM: New York.
Zurück zum Zitat Mukherjee, A., Kumar, A., Liu, B., Wang, J., Hsu, M., Castellanos, M., & Ghosh, R (2013). Spotting opinion spammers using behavioral footprints. In Proceedings of the 19th ACM SIGKDD international conference on knowledge discovery and data mining (pp. 632–640). ACM. Mukherjee, A., Kumar, A., Liu, B., Wang, J., Hsu, M., Castellanos, M., & Ghosh, R (2013). Spotting opinion spammers using behavioral footprints. In Proceedings of the 19th ACM SIGKDD international conference on knowledge discovery and data mining (pp. 632–640). ACM.
Zurück zum Zitat Ott, M., Choi, Y., Cardie, C., & Hancock, J.T (2011). Finding deceptive opinion spam by any stretch of the imagination. In Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies-volume 1 (pp. 309–319). Association for Computational Linguistics. Ott, M., Choi, Y., Cardie, C., & Hancock, J.T (2011). Finding deceptive opinion spam by any stretch of the imagination. In Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies-volume 1 (pp. 309–319). Association for Computational Linguistics.
Zurück zum Zitat Peng, Q. (2014). Store review spammer detection based on review relationship. In Advances in conceptual modeling (pp. 287–298). Springer. Peng, Q. (2014). Store review spammer detection based on review relationship. In Advances in conceptual modeling (pp. 287–298). Springer.
Zurück zum Zitat Sahami, M., Dumais, S., Heckerman, D., & Horvitz, E. (1998). A bayesian approach to filtering junk e-mail, Learning for text categorization: papers from the 1998 workshop (Vol. 62, pp. 98–105). Sahami, M., Dumais, S., Heckerman, D., & Horvitz, E. (1998). A bayesian approach to filtering junk e-mail, Learning for text categorization: papers from the 1998 workshop (Vol. 62, pp. 98–105).
Zurück zum Zitat Sandulescu, V., & Ester, M. (2015). Detecting singleton review spammers using semantic similarity. In Proceedings of the 24th international conference on World Wide Web (pp. 971–976). ACM. Sandulescu, V., & Ester, M. (2015). Detecting singleton review spammers using semantic similarity. In Proceedings of the 24th international conference on World Wide Web (pp. 971–976). ACM.
Zurück zum Zitat Seneviratne, S., Seneviratne, A., Kaafar, M.A., Mahanti, A., & Mohapatra, P. (2015). Early detection of spam mobile apps. In Proceedings of the 24th international conference on World Wide Web, WWW ’15 (pp. 949–959). ACM, New York. Seneviratne, S., Seneviratne, A., Kaafar, M.A., Mahanti, A., & Mohapatra, P. (2015). Early detection of spam mobile apps. In Proceedings of the 24th international conference on World Wide Web, WWW ’15 (pp. 949–959). ACM, New York.
Zurück zum Zitat Tan, P.-N., & et al. (2006). Introduction to data mining. India: Pearson Education. Tan, P.-N., & et al. (2006). Introduction to data mining. India: Pearson Education.
Zurück zum Zitat Vorakulpipat, C., Visoottiviseth, V., & Siwamogsatham, S. (2012). Polite sender: a resource-saving spam email countermeasure based on sender responsibilities and recipient justifications. Computers & Security, 31(3), 286–298.CrossRef Vorakulpipat, C., Visoottiviseth, V., & Siwamogsatham, S. (2012). Polite sender: a resource-saving spam email countermeasure based on sender responsibilities and recipient justifications. Computers & Security, 31(3), 286–298.CrossRef
Zurück zum Zitat Wang, J., & Liang, X. (2013). Discovering the rating pattern of online reviewers through data coclustering. In 2013 IEEE international conference on intelligence and security informatics (ISI) (pp. 374–376). IEEE. Wang, J., & Liang, X. (2013). Discovering the rating pattern of online reviewers through data coclustering. In 2013 IEEE international conference on intelligence and security informatics (ISI) (pp. 374–376). IEEE.
Zurück zum Zitat Wang, G., Xie, S., Liu, B., & Yu, P.S. (2011). Review graph based online store review spammer detection. In Proceedings of the 2011 IEEE 11th international conference on data mining, ICDM’11 (pp. 1242–1247). IEEE Computer Society: Washington, DC, USA. Wang, G., Xie, S., Liu, B., & Yu, P.S. (2011). Review graph based online store review spammer detection. In Proceedings of the 2011 IEEE 11th international conference on data mining, ICDM’11 (pp. 1242–1247). IEEE Computer Society: Washington, DC, USA.
Zurück zum Zitat Wang, G., Xie, S., Liu, B., & Yu, P.S. (2012). Identify online store review spammers via social review graph. ACM Transactions on Intelligent Systems and Technology, 3(4), 61:1–61:21. Wang, G., Xie, S., Liu, B., & Yu, P.S. (2012). Identify online store review spammers via social review graph. ACM Transactions on Intelligent Systems and Technology, 3(4), 61:1–61:21.
Zurück zum Zitat Wilson, T., Hoffmann, P., Somasundaran, S., Kessler, J., Wiebe, J., Choi, Y., Cardie, C., Riloff, E., & Patwardhan, S. (2005). Opinionfinder: A system for subjectivity analysis. In Proceedings of hlt/emnlp on interactive demonstrations (pp. 34–35). Association for Computational Linguistics. Wilson, T., Hoffmann, P., Somasundaran, S., Kessler, J., Wiebe, J., Choi, Y., Cardie, C., Riloff, E., & Patwardhan, S. (2005). Opinionfinder: A system for subjectivity analysis. In Proceedings of hlt/emnlp on interactive demonstrations (pp. 34–35). Association for Computational Linguistics.
Zurück zum Zitat Wu, B., & Davison, B.D. (2005). Identifying link farm spam pages. In Special interest tracks and posters of the 14th international conference on World Wide Web, WWW ’05 (pp. 820–829). ACM, New York. Wu, B., & Davison, B.D. (2005). Identifying link farm spam pages. In Special interest tracks and posters of the 14th international conference on World Wide Web, WWW ’05 (pp. 820–829). ACM, New York.
Zurück zum Zitat Wu, B., Goel, V., & Davison, B.D. (2006). Topical trustrank: Using topicality to combat web spam. In Proceedings of the 15th international conference on World Wide Web, WWW ’06 (pp. 63–72). ACM, New York. Wu, B., Goel, V., & Davison, B.D. (2006). Topical trustrank: Using topicality to combat web spam. In Proceedings of the 15th international conference on World Wide Web, WWW ’06 (pp. 63–72). ACM, New York.
Zurück zum Zitat Xu, C., Zhang, J., Chang, K., & Long, C. (2013). Uncovering collusive spammers in chinese review websites. In Proceedings of the 22nd ACM international conference on conference on information & knowledge management (pp. 979–988). ACM. Xu, C., Zhang, J., Chang, K., & Long, C. (2013). Uncovering collusive spammers in chinese review websites. In Proceedings of the 22nd ACM international conference on conference on information & knowledge management (pp. 979–988). ACM.
Zurück zum Zitat Yuan, G.-X., Ho, C.-H., & Lin, C.-J. (2012). Recent advances of large-scale linear classification. Proceedings of the IEEE, 100(9), 2584–2603.CrossRef Yuan, G.-X., Ho, C.-H., & Lin, C.-J. (2012). Recent advances of large-scale linear classification. Proceedings of the IEEE, 100(9), 2584–2603.CrossRef
Zurück zum Zitat Zhou, Y. (2011). Structure learning of probabilistic graphical models: a comprehensive survey. arXiv:1111.6925. Zhou, Y. (2011). Structure learning of probabilistic graphical models: a comprehensive survey. arXiv:1111.​6925.
Metadaten
Titel
State-of-art approaches for review spammer detection: a survey
verfasst von
Rupesh Kumar Dewang
Anil Kumar Singh
Publikationsdatum
29.03.2017
Verlag
Springer US
Erschienen in
Journal of Intelligent Information Systems / Ausgabe 2/2018
Print ISSN: 0925-9902
Elektronische ISSN: 1573-7675
DOI
https://doi.org/10.1007/s10844-017-0454-7

Weitere Artikel der Ausgabe 2/2018

Journal of Intelligent Information Systems 2/2018 Zur Ausgabe

Premium Partner