ABSTRACT
Our everyday observations about the behaviors of others around us shape how we decide to act or interact. In social media the ability to observe and interpret others' behavior is limited. This work describes one approach to leverage everyday behavioral observations to develop tools that could improve understanding and sense making capabilities of contributors, managers and researchers of social media systems. One example of behavioral observation is Wikipedia Barnstars. Barnstars are a type of award recognizing the activities of Wikipedia editors. We mine the entire English Wikipedia to extract barnstar observations. We develop a multi-label classifier based on a random forest technique to recognize and label distinct forms of observed and acknowledged activity. We evaluate the classifier through several means including use of separate training and testing datasets and the by application of the classifier to previously unlabeled data. We use the classifier to identify Wikipedia editors who have been observed with some predominant types of behavior and explore whether those patterns of behavior are evident and how observers seem to be making the observations. We discuss how these types of activity observations can be used to develop tools and potentially improve understanding and analysis in wikis and other online communities.
- Burke, M. and Kraut, B. 2008. Mopping up: modeling wikipedia promotion decisions. Proceedings of CSCW'08, 27--36. Google ScholarDigital Library
- Bryant, S. L., Forte, A. and Bruckman. A. 2005. Becoming Wikipedian: Transformation of Participation in a Collaborative Online Encyclopedia. Proceedings of GROUP'05. 1--10. Google ScholarDigital Library
- Cheng, W. and Hullermeier, E. 2009. Combining instance-based learning and logistic regression for multilabel classification. Machine Learning. 76(2-3):211--225. Google ScholarDigital Library
- Cosley, D., Frankowski, D., Terveen, L., and Riedl, J. 2007. SuggestBot: Using Intelligent Task Routing to Help People Find Work in Wikipedia. Proceedings of IUI. Google ScholarDigital Library
- Geiger, R. S., and Ribes, D. 2010 The work of Sustaining Order in Wikipedia: The Banning of a Vandal. Proceedings of CSCW'10, 117--126. Google ScholarDigital Library
- Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., and Witten, I. H. 2009. The WEKA Data Mining Software: An Update. SIGKDD Explorations, 11(1). Google ScholarDigital Library
- Krieger, M., Stark, E. M., and Klemmer, S. 2009. Coordinating tasks on the commons: designing for personal goals, expertise and serendipity. Proceedings of CHI'09. 1485--1494. Google ScholarDigital Library
- Kriplean, T., Beschastnikh I., and McDonald, D. W. 2008. Articulations of wikiwork: uncovering valued work in wikipedia through barnstars. Proceedings of CSCW'08. 47--56. Google ScholarDigital Library
- Luther, K., Caine, K., Ziegler, K., and Bruckman, A. 2010. Why It Works (When It Works): Success Factors in Online Creative Collaboration. Proceedings of GROUP'10. 1--10. Google ScholarDigital Library
- Sajnani, H., Javanmardi, S., McDonald, D. W., Lopes, C. V. 2011. Multi-Label Classification of Short Text: A Study on Wikipedia Barnstars. In the AAAI-11 Workshop on Analyzing Microtext.Google Scholar
- Strauss, A. 1985, Work and the Division of Labor. The Sociological Quarterly. 26, 1. 1--19.Google ScholarCross Ref
- Suh, B., Chi, E., Kittur, A., and Pendleton, B. 2008. Lifting the veil: improving accountability and social transparency in Wikipedia with wikidashboard. Proceeding of CHI'08. 1037--1040. Google ScholarDigital Library
- Tsoumakas, G., Vilcek, J., Spyromitros, E., Vlahavas, I. (2010) Mulan: A Java Library for Multi-Label Learning. Journal of Machine Learning Research 1, 1--48. Google ScholarDigital Library
- Tsoumakas, G. and Katakis, I. 2007. Multi-Label Classification: An Overview. International Journal of Data Warehousing and Mining, 3 (3) 1--13, 2007.Google ScholarCross Ref
- Ung, H. and Dalle. J. 2010. Project management in the Wikipedia community. Proceedings of the 6th International Symposium on Wikis and Open Collaboration (WikiSym '10). Google ScholarDigital Library
- Viegas, F. B., Wattenberg, M., and Dave, K. 2004. Studying Cooperation and Conflict Between Authors with History Flow Visualizations. In Proceedings of CHI'04. Google ScholarDigital Library
- Wattenberg, M., Viégas, F. B., and Hollenbach, K. 2007. Visualizing Activity on Wikipedia with Chromograms. Proceedings of INTERACT 2007. LNCS 4663, Part II. 272--287. Google ScholarDigital Library
- Zhang, M. L. and Zhou, Z. H. 2005. A k-Nearest Neighbor Based Algorithm for Multi-label Classification. Proceedings of the 1st IEEE International Conference on Granular Computing.Google Scholar
Index Terms
- Finding patterns in behavioral observations by automatically labeling forms of wikiwork in Barnstars
Recommendations
Automatically Labeling Low Quality Content on Wikipedia By Leveraging Patterns in Editing Behaviors
CSCW2Wikipedia articles aim to be definitive sources of encyclopedic content. Yet, only 0.6% of Wikipedia articles have high quality according to its quality scale due to insufficient number of Wikipedia editors and enormous number of articles. Supervised ...
Automatically Generating Wikipedia Info-boxes from Wikidata
WWW '18: Companion Proceedings of the The Web Conference 2018Info-boxes provide a summary of the most important meta-data relating to a particular entity described by a Wikipedia article. However, many articles have no info-box or have info-boxes with only minimal information; furthermore, there is a huge ...
Performance analysis of triple-frequency ambiguity resolution with BeiDou observations
We investigate triple-frequency ambiguity resolution performance using real BeiDou data. We test four ambiguity resolution (AR) methods which are applicable to triple-frequency observations. These are least squares ambiguity decorrelation adjustment (...
Comments