Skip to main content
Top

2018 | OriginalPaper | Chapter

Automated Credibility Assessment of Web Page Based on Genre

Authors : Shriyansh Agrawal, S. Lalit Mohan, Y. Raghu Reddy

Published in: Big Data Analytics

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

With more than a billion web sites, volume and variety of content available for consumption is huge. However, credibility, an important quality characteristic of web pages is questionable in many cases and tends to be non-uniform. Credibility can increase or reduce the importance of web page leading to potential gain or loss of user base. Credibility without factoring genre of content (for example, Help, Article, Discussion, etc.) can lead to incorrect assessment. Depending on the genre, the importance of features such as web page date time modified, grammar, image to text ratio, in and out links, and other web page features differ. We propose a genre credibility assessment based on web page surface features and their importance in a genre. Further, we built a WEBCred framework to assess GCS (Genre based Credibility Score) with flexibility to add/modify genres, its features and their importance. We validated our approach on 10,429 ‘Information Security’ related web pages; the assessed score correlated 35% with crowd sourced Web Of Trust (WOT) score and 39% with Alexa ranking.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Aggarwal, S., Herre, V.O., Reddy, Y.R., Indurkhya, B.: Providing web credibility assessment support. In: Proceedings of the 2014 European Conference on Cognitive Ergonomics (2014) Aggarwal, S., Herre, V.O., Reddy, Y.R., Indurkhya, B.: Providing web credibility assessment support. In: Proceedings of the 2014 European Conference on Cognitive Ergonomics (2014)
10.
go back to reference Chen, G., Choi, B.: Web page genre classification. In: Proceedings of the Symposium on Applied Computing (2008) Chen, G., Choi, B.: Web page genre classification. In: Proceedings of the Symposium on Applied Computing (2008)
12.
go back to reference Fogg, B.J., Tseng, H.: The elements of computer credibility. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (1999) Fogg, B.J., Tseng, H.: The elements of computer credibility. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (1999)
13.
go back to reference Fogg, B.J., Soohoo, C., Danielson, D.R., Marable, L., Stanford, J., Tauber, E.R.: How do users evaluate the credibility of web sites? A study with over 2,500 participants. In: Proceedings of the Conference on Designing for User Experiences (2003) Fogg, B.J., Soohoo, C., Danielson, D.R., Marable, L., Stanford, J., Tauber, E.R.: How do users evaluate the credibility of web sites? A study with over 2,500 participants. In: Proceedings of the Conference on Designing for User Experiences (2003)
14.
go back to reference Hilligoss, B., Rieh, S.Y.: Developing a unifying framework of credibility assessment: construct, heuristics, and interaction in context. Inf. Process. Manage. 44, 1467–1484 (2008)CrossRef Hilligoss, B., Rieh, S.Y.: Developing a unifying framework of credibility assessment: construct, heuristics, and interaction in context. Inf. Process. Manage. 44, 1467–1484 (2008)CrossRef
15.
go back to reference Hovland, C.I., Weiss, W.: The influence of source credibility on communication effectiveness. Public Opin. Q. 15, 635–650 (1951)CrossRef Hovland, C.I., Weiss, W.: The influence of source credibility on communication effectiveness. Public Opin. Q. 15, 635–650 (1951)CrossRef
16.
go back to reference Iding, M.K., Crosby, M.E., Auernheimer, B., Barbara Klemm, E.: Web site credibility: why do people believe what they believe? J. Instr. Sci. 37, 43–63 (2009)CrossRef Iding, M.K., Crosby, M.E., Auernheimer, B., Barbara Klemm, E.: Web site credibility: why do people believe what they believe? J. Instr. Sci. 37, 43–63 (2009)CrossRef
17.
go back to reference Ipeirotis, P.G.: Demographics of Mechanical Turk (2010) Ipeirotis, P.G.: Demographics of Mechanical Turk (2010)
18.
go back to reference Ivory, M.Y., Hearst, M.A.: Statistical profiles of highly-rated web sites. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (2002) Ivory, M.Y., Hearst, M.A.: Statistical profiles of highly-rated web sites. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (2002)
19.
go back to reference Jones, K.S.: A look back and a look forward. In: Proceedings of the 11th Annual International Conference on Research and Development in Information Retrieval (1988) Jones, K.S.: A look back and a look forward. In: Proceedings of the 11th Annual International Conference on Research and Development in Information Retrieval (1988)
21.
go back to reference Crowston, K.: Reproduced and emergent genres of communication on the world wide web. In: Proceedings of the Thirtieth Hawaii International Conference on System Sciences (1997) Crowston, K.: Reproduced and emergent genres of communication on the world wide web. In: Proceedings of the Thirtieth Hawaii International Conference on System Sciences (1997)
22.
go back to reference Lazar, J., Meiselwitz, G., Feng, J.: Understanding web credibility: a synthesis of the research literature. Found. Trends Hum. Comput. Interact. 1, 139–202 (2007)CrossRef Lazar, J., Meiselwitz, G., Feng, J.: Understanding web credibility: a synthesis of the research literature. Found. Trends Hum. Comput. Interact. 1, 139–202 (2007)CrossRef
23.
go back to reference Metzger, M.J., Flanagin, A.J.: Credibility and trust of information in online environments: the use of cognitive heuristics. J. Pragmat. 59, 210–220 (2013)CrossRef Metzger, M.J., Flanagin, A.J.: Credibility and trust of information in online environments: the use of cognitive heuristics. J. Pragmat. 59, 210–220 (2013)CrossRef
25.
go back to reference Oakleaf, M.: Writing information literacy assessment plans: a guide to best practice. Commun. Inf. Lit. 3, 4 (2010) Oakleaf, M.: Writing information literacy assessment plans: a guide to best practice. Commun. Inf. Lit. 3, 4 (2010)
26.
go back to reference O’Grady, L.: Future directions for depicting credibility in health care web sites. Int. J. Med. Inform. 75, 58–65 (2006)CrossRef O’Grady, L.: Future directions for depicting credibility in health care web sites. Int. J. Med. Inform. 75, 58–65 (2006)CrossRef
28.
go back to reference Page, L., Brin, S., Motwani, R., Winograd, T.: The pagerank citation ranking: bringing order to the web. Technical report (1999) Page, L., Brin, S., Motwani, R., Winograd, T.: The pagerank citation ranking: bringing order to the web. Technical report (1999)
29.
go back to reference Pihur, V., Datta, S., Datta, S.: RankAggreg, an R package for weighted rank aggregation. J. BMC Bioinform. 10, 62 (2009)CrossRef Pihur, V., Datta, S., Datta, S.: RankAggreg, an R package for weighted rank aggregation. J. BMC Bioinform. 10, 62 (2009)CrossRef
30.
go back to reference Pollach, I.: Electronic word of mouth: a genre analysis of product reviews on consumer opinion web sites. In: Proceedings of the 39th Annual Hawaii International Conference on System Sciences (2006) Pollach, I.: Electronic word of mouth: a genre analysis of product reviews on consumer opinion web sites. In: Proceedings of the 39th Annual Hawaii International Conference on System Sciences (2006)
31.
go back to reference Sanagavarapu, L.M., Sarangi, S., Reddy, Y.R., Varma, V.: Fine grained approach for domain specific seed URL extraction. In: Proceedings of the 38th Annual Hawaii International Conference on System Sciences (2018) Sanagavarapu, L.M., Sarangi, S., Reddy, Y.R., Varma, V.: Fine grained approach for domain specific seed URL extraction. In: Proceedings of the 38th Annual Hawaii International Conference on System Sciences (2018)
32.
go back to reference Santini, M., Power, R., Evans, R.: Implementing a characterization of genre for automatic genre identification of web pages. In: 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics (2006) Santini, M., Power, R., Evans, R.: Implementing a characterization of genre for automatic genre identification of web pages. In: 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics (2006)
35.
go back to reference Sundar, S.S.: The MAIN model: a heuristic approach to understanding technology effects on credibility. J. Digit. Media Youth Credibility 73100, 78–92 (2007) Sundar, S.S.: The MAIN model: a heuristic approach to understanding technology effects on credibility. J. Digit. Media Youth Credibility 73100, 78–92 (2007)
36.
go back to reference Wathen, C.N., Burkell, J.: Believe it or not: factors influencing credibility on the web. J. Am. Soc. Inf. Sci. Technol. 53, 134–144 (2002)CrossRef Wathen, C.N., Burkell, J.: Believe it or not: factors influencing credibility on the web. J. Am. Soc. Inf. Sci. Technol. 53, 134–144 (2002)CrossRef
37.
go back to reference Yamamoto, Y., Tanaka, K.: Enhancing credibility judgment of web search results. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (2011) Yamamoto, Y., Tanaka, K.: Enhancing credibility judgment of web search results. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (2011)
Metadata
Title
Automated Credibility Assessment of Web Page Based on Genre
Authors
Shriyansh Agrawal
S. Lalit Mohan
Y. Raghu Reddy
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-030-04780-1_11

Premium Partner