nach oben

Soft Computing

Erschienen in:

02.11.2019 | Methodologies and Application

Automating fake news detection system using multi-level voting model

verfasst von: Sawinder Kaur, Parteek Kumar, Ponnurangam Kumaraguru

Erschienen in: Soft Computing | Ausgabe 12/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

The issues of online fake news have attained an increasing eminence in the diffusion of shaping news stories online. Misleading or unreliable information in the form of videos, posts, articles, URLs is extensively disseminated through popular social media platforms such as Facebook and Twitter. As a result, editors and journalists are in need of new tools that can help them to pace up the verification process for the content that has been originated from social media. Motivated by the need for automated detection of fake news, the goal is to find out which classification model identifies phony features accurately using three feature extraction techniques, Term Frequency–Inverse Document Frequency (TF–IDF), Count-Vectorizer (CV) and Hashing-Vectorizer (HV). Also, in this paper, a novel multi-level voting ensemble model is proposed. The proposed system has been tested on three datasets using twelve classifiers. These ML classifiers are combined based on their false prediction ratio. It has been observed that the Passive Aggressive, Logistic Regression and Linear Support Vector Classifier (LinearSVC) individually perform best using TF-IDF, CV and HV feature extraction approaches, respectively, based on their performance metrics, whereas the proposed model outperforms the Passive Aggressive model by 0.8%, Logistic Regression model by 1.3%, LinearSVC model by 0.4% using TF-IDF, CV and HV, respectively. The proposed system can also be used to predict the fake content (textual form) from online social media websites.

Vorheriger Artikel Modeling and stability analysis methods of neutrosophic transfer functions

Nächster Artikel Distance related: a procedure for applying directly Artificial Bee Colony algorithm in routing problems

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Aggarwal A, Rajadesingan A, Kumaraguru P (2012) PhishAri: automatic realtime phishing detection on twitter. In: eCrime researchers summit (eCrime). IEEE, pp 1–12

Aggarwal A, Kumar S, Bhargava K, Kumaraguru P (2018) The follower count fallacy: detecting twitter users with manipulated follower count

Ahmed F, Abulaish M (2012) An MCL-based approach for spam profile detection in online social networks. In: IEEE 11th international conference on trust, security and privacy in computing and communications (TrustCom). IEEE, pp 602–608

Ahmed H, Traore I, Saad S (2017) Detection of online fake news using n-gram analysis and machine learning techniques. In: International conference on intelligent, secure, and dependable systems in distributed and cloud environments. Springer, pp 127–138

Alahmadi A, Joorabchi A, Mahdi AE (2013) A new text representation scheme combining bag-of-words and bag-of-concepts approaches for automatic text classification. In: 2013 7th IEEE GCC conference and exhibition (GCC). IEEE, pp 108–113

Batchelor O (2017) Getting out the truth: the role of libraries in the fight against fake news. Ref Serv Rev 45(2):143CrossRef

Benevenuto F, Rodrigues T, Almeida V, Almeida J, Gonçalves M (2009) Detecting spammers and content promoters in online video social networks. In: Proceedings of the 32nd international ACM SIGIR conference on research and development in information retrieval. ACM, pp 620–627

Benevenuto F, Magno G, Rodrigues T, Almeida V (2010) Detecting spammers on twitter. In: Collaboration, electronic messaging, anti-abuse and spam conference (CEAS), vol 6, p 12

Caetano JA, de Oliveira JF, Lima HS, Marques-Neto HT, Magno G, Meira W Jr, Almeida VA (2018) Analyzing and characterizing political discussions in WhatsApp public groups. arXiv preprint arXiv:1804.00397

Canini KR, Suh B, Pirolli PL (2011) Finding credible information sources in social networks based on content and social structure. In: IEEE third international conference on social computing (SocialCom). IEEE third international conference on privacy, security, risk and trust (PASSAT). IEEE, pp 1–8

Chen Y, Conroy NJ, Rubin VL (2015) Misleading online content: recognizing clickbait as false news. In: Proceedings of the 2015 ACM on workshop on multimodal deception detection. ACM, pp 15–19

Chhabra S, Aggarwal A, Benevenuto F, Kumaraguru P (2011) Phi.sh\$ocial: the phishing landscape through short URLs. In: Proceedings of the 8th annual collaboration, electronic messaging, anti-abuse and spam conference. ACM, pp 92–101

Conroy NJ, Rubin VL, Chen Y (2015) Automatic deception detection: methods for finding fake news. Proc Assoc Inf Sci Technol 52(1):1CrossRef

D’Angelo G, Palmieri F, Rampone S (2019) Detecting unfair recommendations in trust-based pervasive environments. Inf Sci 486:31CrossRef

Dewan P, Kumaraguru P (2015) Towards automatic real time identification of malicious posts on facebook. In: 13th Annual conference on privacy, security and trust (PST). IEEE, pp 85–92

Dewan P, Kumaraguru P (2017) Facebook inspector (FbI): towards automatic real-time detection of malicious content on Facebook. Soc Netw Anal Min 7(1):15CrossRef

Dewan P, Gupta M, Goyal K, Kumaraguru P (2013) Multiosn: realtime monitoring of real world events on multiple online social media. In: Proceedings of the 5th IBM collaborative academia research exchange workshop. ACM, p 6

Fake news on whatsapp. http://bit.ly/2miuv9j. Last accessed 27 Aug 2019

Gao H, Hu J, Wilson C, Li Z, Chen Y, Zhao BY (2010) Detecting and characterizing social spam campaigns. In: Proceedings of the 10th ACM SIGCOMM conference on internet measurement. ACM, pp 35–47

Garimella K, Tyson G (2018) WhatsApp, doc? A first look at WhatsApp public group data. arXiv preprint arXiv:1804.01473

Gupta A, Kumaraguru P (2012a) Credibility ranking of tweets during high impact events. In: Proceedings of the 1st workshop on privacy and security in online social media. ACM, p 2

Gupta A, Kumaraguru P (2012b) Twitter explodes with activity in Mumbai blasts! a lifeline or an unmonitored daemon in the lurking? Technical report

Gupta A, Lamba H, Kumaraguru P (2013a) \$ 1.00 per rt #BostonMarathon #PrayForBoston: analyzing fake content on twitter. In: eCrime researchers summit (eCRS). IEEE, pp 1–12

Gupta A, Lamba H, Kumaraguru P, Joshi A (2013b) Faking sandy: characterizing and identifying fake images on twitter during hurricane sandy. In: Proceedings of the 22nd international conference on world wide web. ACM, pp 729–736

Jain P, Kumaraguru P (2016) On the dynamics of username changing behavior on twitter. In: Proceedings of the 3rd IKDD conference on data science. ACM, p 6

Kaggle database. https://bit.ly/2BmqBQE. Last accessed 22 Oct 2017

Kaggle database. https://bit.ly/2Ex5VsX. Last accessed 24 Oct 2017

LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436

Kuleshov V, Thakoor S, Lau T, Ermon S (2018) Adversarial examples for natural language classification problems

Magdy A, Wanas N (2010) Web-based statistical fact checking of textual documents. In: Proceedings of the 2nd international workshop on search and mining user-generated contents. ACM, pp 103–110

Markines B, Cattuto C, Menczer F (2009) Social spam detection. In Proceedings of the 5th international workshop on adversarial information retrieval on the web. ACM, pp 41–48

Mishu SZ, Rafiuddin S (2016) Performance analysis of supervised machine learning algorithms for text classification. In: 19th International conference on computer and information technology (ICCIT). IEEE, pp 409–413

News trends database. https://bit.ly/2zVRLxK. Last accessed 18 Oct 2017

Pontes T, Magno T, Vasconcelos M, Gupta A, Almeida J, Kumaraguru P, Almeida V (2012a) Beware of what you share: inferring home location in social networks. In: IEEE 12th international conference on data mining workshops (ICDMW). IEEE, pp 571–578

Pontes T, Vasconcelos M, Almeida J, Kumaraguru P, Almeida V (2012b) We know where you live: privacy characterization of foursquare behavior. In: Proceedings of the 2012 ACM conference on ubiquitous computing. ACM, pp 898–905

Qazvinian V, Rosengren E, Radev DR, Mei Q (2011) Rumor has it: identifying misinformation in microblogs. In: Proceedings of the conference on empirical methods in natural language processing. Association for Computational Linguistics, pp 1589–1599

Rubin VL, Conroy NJ, Chen Y (2015) Towards news verification: deception detection methods for news discourse. In: Hawaii international conference on system sciences

Rubin V, Conroy N, Chen Y, Cornwell S (2016) Fake news or truth? Using satirical cues to detect potentially misleading news. In: Proceedings of the second workshop on computational approaches to deception detection , pp 7–17

Ruchansky N, Seo S, Liu Y (2017) CSI: a hybrid deep model for fake news detection. In: Proceedings of the 2017 ACM on conference on information and knowledge management. ACM, pp 797–806

Sen I, Aggarwal A, Mian S, Singh S, Kumaraguru P, Datta A (2018) Worth its weight in likes: towards detecting fake likes on Instagram. In: Proceedings of the 10th ACM conference on web science. ACM, pp 205–209

Shu K, Sliva A, Wang S, Tang J, Liu H (2017) Fake news detection on social media: a data mining perspective. ACM SIGKDD Explor Newsl 19(1):22CrossRef

Sirajudeen SM, Azmi NFA, Abubakar AI (2017) Online fake news detection algorithm. J Theor Appl Inf Technol 95(17):4114

Stein B, Zu Eissen SM (2008) Retrieval models for genre classification. Scand J Inf Syst 20(1):3

Volkova S, Shaffer K, Jang JY, Hodas N (2017) Separating facts from fiction: linguistic models to classify suspicious and trusted news posts on twitter. In Proceedings of the 55th annual meeting of the association for computational linguistics (volume 2, short papers), vol 2, pp 647–653

Wang AH (2010) Don’t follow me: spam detection in twitter. In: Proceedings of the 2010 international conference on security and cryptography (SECRYPT). IEEE, pp 1–10

Wei W, Wan X (2017) Learning to identify ambiguous and misleading news headlines. arXiv preprint arXiv1705.06031

Weimer M, Gurevych I, Mühlhäuser M (2007) Automatically assessing the post quality in online discussions on software. In: Proceedings of the 45th annual meeting of the ACL on interactive poster and demonstration sessions. Association for Computational Linguistics, pp 125–128

Titel: Automating fake news detection system using multi-level voting model
verfasst von: Sawinder Kaur
Parteek Kumar
Ponnurangam Kumaraguru
Publikationsdatum: 02.11.2019
Verlag: Springer Berlin Heidelberg
Erschienen in: Soft Computing / Ausgabe 12/2020
Print ISSN: 1432-7643
Elektronische ISSN: 1433-7479
DOI: https://doi.org/10.1007/s00500-019-04436-y

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Weitere Artikel der Ausgabe 12/2020

Terminal observer and disturbance observer for the class of fractional-order chaotic systems

Representation of De Morgan and (Semi-)Kleene Lattices

On the effects of pseudorandom and quantum-random number generators in soft computing

Testing for non-chaoticity under noisy dynamics using the largest Lyapunov exponent

Clustering data stream with uncertainty using belief function theory and fading function

On possible outputs of group decision making with interval uncertainties based on simulation techniques