ABSTRACT
In this paper, we propose a method to identify good quality Wikipedia articles by mutually evaluating editors and texts. A major approach for assessing article quality is a text survival ratio based approach. In this approach, when a text survives beyond multiple edits, the text is assessed as good quality. This approach assumes that poor quality texts are deleted by editors with high possibility. However, many vandals delete good quality texts frequently, then the survival ratios of good quality texts are improperly decreased by vandals. As a result, many good quality texts are unfairly assessed as poor quality. In our method, we consider editor quality for calculating text quality, and decrease the impacts on text qualities by the vandals who has low quality. Using this improvement, the accuracy of the text quality should be improved. However, an inherent problem of this idea is that the editor qualities are calculated by the text qualities. To solve this problem, we mutually calculate the editor and text qualities until they converge. We did our experimental evaluation, and we confirmed that the proposed method could accurately assess the text qualities.
- B. T. Adler, K. Chatterjee, L. de Alfaro, M. Faella, I. Pye, and V. Raman. Assigning Trust to Wikipedia Content. In Proceedings of the International Symposium on Wikis (WikiSym '08). ACM, 2008. Google ScholarDigital Library
- B. T. Adler and L. de Alfaro. A content-driven reputation system for the Wikipedia. In Proceedings of the 16th international conference on World Wide Web (WWW '07), pages 261--270, 2007. Google ScholarDigital Library
- B. T. Adler, K. Chatterjee, L. de Alfaro, M. Faella, I. Pye, and V. Raman. Measuting Author Contributions to the Wikipedia. In Proceedings of the 2008 International Symposium on Wikis (WikiSym '08), 2008. Google ScholarDigital Library
- M. Hu, E. Lim, A. Sun, H. W. Lauw, and B. Vuong. Measuring Article Quality in Wikipedia: Models and Evaluation. In Proceedings of ACM International Conference on Information and Knowledge Management (CIKM 2007), pages 243--252, 2007. Google ScholarDigital Library
- D. M. Wilkinson and B. A. Huberman. Cooperation and quality in wikipedia. In Proceedings of the 2007 international symposium on Wikis (WikiSym '07), pages 157--164. ACM, 2007. Google ScholarDigital Library
- Jon M. Kleinberg. Authoritative sources in a hyperlinked environment. J. ACM, 46:604--632, 1999. Google ScholarDigital Library
- Sergey Brin and Lawrence Page. The anatomy of a large-scale hypertextual web search engine. In Proceedings of the International Conference on World Wide Web (WWW'97), 1997. Google ScholarDigital Library
- R. Lempel and S. Moran. SALSA: the stochastic approach for link-structure analysis. ACM Trans. Inf. Syst., 19(2):131--160, April 2001. Google ScholarDigital Library
- Monika Henzinger. Link analysis in web information retrieval. IEEE DATA ENGINEERING BULLETIN, 23:3--8, 2000.Google Scholar
- B. Stvilia, M. Twidale, L. Smith, and L. Gasser. Information quality work organization in wikipedia. J. Am. Soc. Inf. Sci. Technol., 59(6):983--1001, 2008. Google ScholarDigital Library
- B. Stvilia, L. Gasser, M. B. Twidale, and L. C. Smith. A Framework for Information Quality Assessment. Journal of the American Society for Information Science and Technology, 58(12):1720--1733, 2007. Google ScholarDigital Library
- M. Kramer, A. Gregorowicz, and B. Iyer. Wiki trust metrics based on phrasal analysis. In Proceedings of International Symposium on Wikis (WikiSym '08), 2008. Google ScholarDigital Library
- M. G. Siegler. Youtube comes to a 5-star realization: Its ratings are useless: http://www.techcrunch.com/2009/09/22/youtube-comes-to-a-5-star-realization-its-ratings-are-useless/, September 2009.Google Scholar
- T. Wöhner and R. Peters. Assessing the quality of Wikipedia articles with lifecycle based metrics. In Proceedings of the International Symposium on Wikis and Open Collaboration (WikiSym '09), 2009. Google ScholarDigital Library
- Reid Priedhorsky, Jilin Chen, Shyong (Tony) K. Lam, Katherine Panciera, Loren Terveen, and John Riedl. Creating, destroying, and restoring value in wikipedia. In Proceedings of the 2007 international ACM conference on Supporting group work, GROUP '07, pages 259--268, New York, NY, USA, 2007. ACM. Google ScholarDigital Library
- Katherine Panciera, Aaron Halfaker, and Loren Terveen. Wikipedians are born, not made: a study of power editors on wikipedia. In Proceedings of the ACM 2009 international conference on Supporting group work, GROUP '09, pages 51--60, New York, NY, USA, 2009. ACM. Google ScholarDigital Library
- A. Halfaker, A. Kittur, R. Kraut, and J. Riedl. A Jury of Your peers: Quality, Experience and Ownership in Wikipedia. In Proceedings of the International Symposium on Wikis and Open Collaboration (WikiSym '09), pages 1--10, 2009. Google ScholarDigital Library
- Peter Kin-Fong Fong and Robert P. Biuk-Aghai. What did they do? deriving high-level edit histories in wikis. In Proceedings of the 6th International Symposium on Wikis and Open Collaboration, WikiSym '10, pages 2:1--2:10, New York, NY, USA, 2010. ACM. Google ScholarDigital Library
- Ricardo Baeza-Yates and Berthier Ribeiro-Neto. Modern Information Retrieval: the concepts and technology behind search. Addison-Wesley, 2011. Google ScholarDigital Library
- E. G. Toms, T. Mackenzie, C. Jordan, and S. Hall. wikiSearch: enabling interactivity in search. In Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2009), page 843, 2009. Google ScholarDigital Library
- M. Sabel. Structuring Wiki revision history. In Proceedings of the 2007 international symposium on Wikis (WikiSym '07), pages 125--130. ACM, 2007. Google ScholarDigital Library
- Marti A. Hearst. Search User Interfaces. Cambridge University Press, 2009. Google ScholarDigital Library
- T. Holloway, M. Bozicevic, and K. Börner. Analyzing and visualizing the semantic coverage of Wikipedia and its authors. Complexity, 12(3):30--40, 2007. Google ScholarDigital Library
- B. Otjacques, M. Cornil, and F. Feltz. Visualizing Cooperative Activities with Ellimaps: The Case of Wikipedia. In Y. Luo, editor, Cooperative Design, Visualization, and Engineering (CDVE '09), volume 5738 of Lecture Notes in Computer Science, pages 44--51. Springer, 2009. Google ScholarDigital Library
Index Terms
- Mutual evaluation of editors and texts for assessing quality of Wikipedia articles
Recommendations
Assessing the Quality of Wikipedia Articles
ICMLSC '21: Proceedings of the 2021 5th International Conference on Machine Learning and Soft ComputingWikipedia is a very important information reference source for the Internet users. Due to the fact that the content of Wikipedia is the collaborative result from a massive number of participants all over the world, the quality of Wikipedia might be ...
Assessing quality score of Wikipedia article using mutual evaluation of editors and texts
CIKM '13: Proceedings of the 22nd ACM international conference on Information & Knowledge ManagementIn this paper, we propose a method for assessing quality scores of Wikipedia articles by mutually evaluating editors and texts. Survival ratio based approach is a major approach to assessing article quality. In this approach, when a text survives beyond ...
On measuring the quality of Wikipedia articles
WICOW '10: Proceedings of the 4th workshop on Information credibilityThis paper discusses an approach to modeling and measuring information quality of Wikipedia articles. The approach is based on the idea that the quality of Wikipedia articles with distinctly different profiles needs to be measured using different ...
Comments