Skip to main content

2020 | OriginalPaper | Buchkapitel

A Novel Approach for Selecting Hybrid Features from Online News Textual Metadata for Fake News Detection

verfasst von : Mohamed K. Elhadad, Kin Fun Li, Fayez Gebali

Erschienen in: Advances on P2P, Parallel, Grid, Cloud and Internet Computing

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Nowadays, online news platforms have become the main sources of news for many users. Hence, an urgent need arises to find a way to classify this news automatically and measure its validity to avoid spreading fake news. In this paper, we tried to simulate how humans, in real life, are dealing with news documents. We introduced a new way in which we can deal with the whole textual content of the news documents by extracting a number of characteristics of those texts and extracting a complex set of other metadata related features without segmenting the news documents into parts (title, content, date, source, etc.). Performances of nine machine learning algorithms in terms of Accuracies, Precision, Recall and F1-score are compared when using three different datasets obtaining much better result than the results in [1] and [2].

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Khan, J.Y., Khondaker, M., Islam, T., Iqbal, A., Afroz, S.: A benchmark study on machine learning methods for fake news detection. arXiv preprint arXiv:1905.04749 (2019) Khan, J.Y., Khondaker, M., Islam, T., Iqbal, A., Afroz, S.: A benchmark study on machine learning methods for fake news detection. arXiv preprint arXiv:​1905.​04749 (2019)
2.
Zurück zum Zitat Ahmed, H., Traore, I., Saad, S.: Detection of online fake news using N-gram analysis and machine learning techniques. In: International Conference on Intelligent, Secure, and Dependable Systems in Distributed and Cloud Environments, vol. 10618, pp. 127–138. Springer, Cham (2017) Ahmed, H., Traore, I., Saad, S.: Detection of online fake news using N-gram analysis and machine learning techniques. In: International Conference on Intelligent, Secure, and Dependable Systems in Distributed and Cloud Environments, vol. 10618, pp. 127–138. Springer, Cham (2017)
3.
Zurück zum Zitat Elhadad, M.K., Li, K.F., Gebali, F.: Fake news detection on social media: a systematic survey. In: 2019 IEEE Pacific Rim Conference on Communications, Computers, and Signal Processing, Victoria, B.C., Canada. IEEE (2019) Elhadad, M.K., Li, K.F., Gebali, F.: Fake news detection on social media: a systematic survey. In: 2019 IEEE Pacific Rim Conference on Communications, Computers, and Signal Processing, Victoria, B.C., Canada. IEEE (2019)
4.
Zurück zum Zitat Bondielli, A., Marcelloni, F.: A survey on fake news and rumour detection techniques. Inf. Sci. 497, 38–55 (2019)CrossRef Bondielli, A., Marcelloni, F.: A survey on fake news and rumour detection techniques. Inf. Sci. 497, 38–55 (2019)CrossRef
5.
Zurück zum Zitat Ruchansky, N., Seo, S., Liu, Y.: CSI: a hybrid deep model for fake news detection. In: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, pp. 797–806. ACM (2017) Ruchansky, N., Seo, S., Liu, Y.: CSI: a hybrid deep model for fake news detection. In: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, pp. 797–806. ACM (2017)
6.
Zurück zum Zitat Potthast, M., Kiesel, J., Reinartz, K., Bevendorff, J., Stein, B.: A stylometric inquiry into hyperpartisan and fake news. arXiv preprint arXiv:1702.05638 (2017) Potthast, M., Kiesel, J., Reinartz, K., Bevendorff, J., Stein, B.: A stylometric inquiry into hyperpartisan and fake news. arXiv preprint arXiv:​1702.​05638 (2017)
7.
Zurück zum Zitat Khurana, U.: The linguistic features of fake news headlines and statements. Dissertation Master’s thesis, University of Amsterdam (2017) Khurana, U.: The linguistic features of fake news headlines and statements. Dissertation Master’s thesis, University of Amsterdam (2017)
8.
Zurück zum Zitat Ahmed, H., Traore, I., Saad, S.: Detecting opinion spams and fake news using text classification. Secur. Priv. 1(1), 1–15 (2018)CrossRef Ahmed, H., Traore, I., Saad, S.: Detecting opinion spams and fake news using text classification. Secur. Priv. 1(1), 1–15 (2018)CrossRef
9.
Zurück zum Zitat Ott, M., Choi, Y., Cardie, C., Hancock, J.T.: Finding deceptive opinion spam by any stretch of the imagination. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol. 1. Association for Computational Linguistics, pp. 309–319 (2011) Ott, M., Choi, Y., Cardie, C., Hancock, J.T.: Finding deceptive opinion spam by any stretch of the imagination. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol. 1. Association for Computational Linguistics, pp. 309–319 (2011)
10.
Zurück zum Zitat Al Asaad, B., Erascu, M.: A tool for fake news detection. In: 2018 20th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC). IEEE (2018) Al Asaad, B., Erascu, M.: A tool for fake news detection. In: 2018 20th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC). IEEE (2018)
11.
Zurück zum Zitat Bali, A.P.S., Fernandes, M., Choubey, S., Goel, M.: Comparative performance of machine learning algorithms for fake news detection. In: International Conference on Advances in Computing and Data Sciences, pp. 420–430. Springer (2019) Bali, A.P.S., Fernandes, M., Choubey, S., Goel, M.: Comparative performance of machine learning algorithms for fake news detection. In: International Conference on Advances in Computing and Data Sciences, pp. 420–430. Springer (2019)
12.
Zurück zum Zitat Elhadad, M.K., Badran, K.M., Salama, G.I.: Towards ontology-based web text document classification. In: International Conference on Aerospace Sciences & Aviation Technology (2017) Elhadad, M.K., Badran, K.M., Salama, G.I.: Towards ontology-based web text document classification. In: International Conference on Aerospace Sciences & Aviation Technology (2017)
13.
Zurück zum Zitat Zhu, Z., Liang, J., Li, D., Yu, H., Liu, G.: Hot topic detection based on a refined TF-IDF algorithm. IEEE Access 7, 26996–27007 (2019)CrossRef Zhu, Z., Liang, J., Li, D., Yu, H., Liu, G.: Hot topic detection based on a refined TF-IDF algorithm. IEEE Access 7, 26996–27007 (2019)CrossRef
14.
Zurück zum Zitat Fengling, W.: Research on hot topic discovery based on intelligent campus information service platform. In: The 3rd Information Technology, Networking, Electronic and Automation Control Conference. IEEE (2019) Fengling, W.: Research on hot topic discovery based on intelligent campus information service platform. In: The 3rd Information Technology, Networking, Electronic and Automation Control Conference. IEEE (2019)
15.
Zurück zum Zitat Xu, G., Meng, Y., Chen, Z., Qiu, X., Wang, C., Yao, H.: Research on topic detection and tracking for online news texts. IEEE Access 7, 58407–58418 (2019)CrossRef Xu, G., Meng, Y., Chen, Z., Qiu, X., Wang, C., Yao, H.: Research on topic detection and tracking for online news texts. IEEE Access 7, 58407–58418 (2019)CrossRef
16.
Zurück zum Zitat Wan, J., Zheng, P., Si, H., Xiong, N.N., Zhang, W., Vasilakos, A.V.: An artificial intelligence driven multi-feature extraction scheme for big data detection. IEEE Access 7, 80122–80132 (2019)CrossRef Wan, J., Zheng, P., Si, H., Xiong, N.N., Zhang, W., Vasilakos, A.V.: An artificial intelligence driven multi-feature extraction scheme for big data detection. IEEE Access 7, 80122–80132 (2019)CrossRef
17.
Zurück zum Zitat Ju, Q.: Large-scale structural reranking for hierarchical text categorization. Dissertation Ph.D. University of Trento, Italy (2013) Ju, Q.: Large-scale structural reranking for hierarchical text categorization. Dissertation Ph.D. University of Trento, Italy (2013)
18.
Zurück zum Zitat Della Vedova, M.L., Tacchini, E., Moret, S., Ballarin, G., DiPierro, M., De Alfaro, L.: Automatic online fake news detection combining content and social signals. In: the 22nd Conference of Open Innovations Association (FRUCT), pp. 272–279. IEEE (2018) Della Vedova, M.L., Tacchini, E., Moret, S., Ballarin, G., DiPierro, M., De Alfaro, L.: Automatic online fake news detection combining content and social signals. In: the 22nd Conference of Open Innovations Association (FRUCT), pp. 272–279. IEEE (2018)
19.
Zurück zum Zitat Elhadad, M.K., Li, K.F., Gebali, F.: Sentiment analysis of Arabic and English tweets. In: Workshops of the International Conference on Advanced Information Networking and Applications. Springer, Cham (2019) Elhadad, M.K., Li, K.F., Gebali, F.: Sentiment analysis of Arabic and English tweets. In: Workshops of the International Conference on Advanced Information Networking and Applications. Springer, Cham (2019)
20.
Zurück zum Zitat Xu, K., Wang, F., Wang, H., Yang, B.: Detecting fake news over online social media via domain reputations and content understanding. Tsinghua Sci. Technol 25(1), 20–27 (2019)CrossRef Xu, K., Wang, F., Wang, H., Yang, B.: Detecting fake news over online social media via domain reputations and content understanding. Tsinghua Sci. Technol 25(1), 20–27 (2019)CrossRef
21.
Zurück zum Zitat Elhadad, M.K., Badran, K.M., Salama, G.I.: A novel approach for ontology-based dimensionality reduction for web text document classification. In: 2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS). IEEE (2017) Elhadad, M.K., Badran, K.M., Salama, G.I.: A novel approach for ontology-based dimensionality reduction for web text document classification. In: 2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS). IEEE (2017)
22.
Zurück zum Zitat Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J.: Scikit-learn: machine learning in Python. J. ML Res. 12, 2825–2830 (2011)MathSciNetMATH Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J.: Scikit-learn: machine learning in Python. J. ML Res. 12, 2825–2830 (2011)MathSciNetMATH
23.
Zurück zum Zitat Wang, W.Y.: “liar, liar pants on fire”: a new benchmark dataset for fake news detection. arXiv preprint, arXiv:1705.00648 (2017) Wang, W.Y.: “liar, liar pants on fire”: a new benchmark dataset for fake news detection. arXiv preprint, arXiv:​1705.​00648 (2017)
24.
Zurück zum Zitat Salem, F.K.A., Al Feel, R., Elbassuoni, S., Jaber, M., Farah, M.: FA-KES: a fake news dataset around the Syrian war. In: Proceedings of the International AAAI Conference on Web and Social Media (ICWSM 2019). AAAI (2019) Salem, F.K.A., Al Feel, R., Elbassuoni, S., Jaber, M., Farah, M.: FA-KES: a fake news dataset around the Syrian war. In: Proceedings of the International AAAI Conference on Web and Social Media (ICWSM 2019). AAAI (2019)
25.
Zurück zum Zitat Elhadad, M.K., Badran, K.M., Salama, G.I.: A novel approach for ontology-based feature vector generation for web text document classification. Int. J. Softw. Innov. 6(1), 1–10 (2018)CrossRef Elhadad, M.K., Badran, K.M., Salama, G.I.: A novel approach for ontology-based feature vector generation for web text document classification. Int. J. Softw. Innov. 6(1), 1–10 (2018)CrossRef
Metadaten
Titel
A Novel Approach for Selecting Hybrid Features from Online News Textual Metadata for Fake News Detection
verfasst von
Mohamed K. Elhadad
Kin Fun Li
Fayez Gebali
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-33509-0_86