ABSTRACT
Readability is one of key factors determining document quality and reader's satisfaction. In this paper we analyze readability of Wikipedia, which is a popular source of information for searchers about unknown topics. Although Wikipedia articles are frequently listed by search engines on top ranks, they are often too difficult for average readers searching information about difficult queries. We examine the average readability of content in Wikipedia and compare it to the one in Simple Wikipedia and Britannica. Next, we investigate readability of selected categories in Wikipedia. Apart from standard readability measures we use some new metrics based on words' popularity and their distributions across different document genres and topics.
- J. Blumenstock. Automatically Assessing the Quality of Wikipedia Articles, Recent works, School of Information, UC Berkeley, 2008Google Scholar
- M. Coleman and T. L. Liau. A computer readability formula designed for machine scoring, Journal of Applied Psychology, Vol. 60, pp. 283--284, 1975.Google ScholarCross Ref
- K. Collins-Thompson and J. P. Callan. A language modeling approach to predicting reading difficulty. In HLT-NAACL 2004Google Scholar
- E. Dale and J.S. Chall. The concept of readability, Elementary English, 26(23), 1949Google Scholar
- K. Ehmann, A. Large, and J. Beheshti. Collaboration in Context: Comparing Article Evolution among Subject Disciplines in Wikipedia, First Monday, Volume 13 Number 10, 2008.Google ScholarCross Ref
- L. Feng, N. Elhadad, and M. Huenerfauth. Cognitively Motivated Features for Readability Assessment. In ECCL, 2009 Google ScholarDigital Library
- R. Flesch. A new readability yardstick, Journal of Applied Psychology, 1948, 32(3), pp. 221--233.Google ScholarCross Ref
- J. Giles. Internet encyclopaedias go head to head, Nature 438, 900--901, 2005Google ScholarCross Ref
- D.I. Shalowitz and S. Wolf. Shared decision-making and the lower literate patient. Journal of law, medicine & ethics, 32, 759--64, 2004Google Scholar
- J. Ure. Lexical density and register differentiation, London: Cambridge University Press, 443--452, 1971Google Scholar
Index Terms
- Is wikipedia too difficult?: comparative analysis of readability of wikipedia, simple wikipedia and britannica
Recommendations
Assessing the Readability of Web Search Results for Searchers with Dyslexia
SIGIR '18: The 41st International ACM SIGIR Conference on Research & Development in Information RetrievalStandards organizations, (e.g., the World Wide Web Consortium), are placing increased importance on the cognitive accessibility of online systems, including web search. Previous work has shown an association between query-document relevance judgments, ...
Easiest-first search: towards comprehension-based web search
CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge managementAlthough Web search engines have become information gateways to the Internet, for queries containing technical terms, search results often contain pages that are difficult to be understood by non-expert users. Therefore, re-ranking search results in a ...
Readability of Wikipedia pages on andrology and gynecology: comparative study
AbstractIn this era of modern information and communication technology, the Internet is rapidly being used for health-related information. According to recent studies and trends, the Internet is a main source of health information. Wikipedia is a free ...
Comments