research-article

Finding patterns in behavioral observations by automatically labeling forms of wikiwork in Barnstars

Authors:
David W. McDonald

University of Washington

University of Washington
View Profile

,
Sara Javanmardi

University of California Irvine

University of California Irvine
View Profile

,
Mark Zachry

University of Washington

University of Washington
View Profile

WikiSym '11: Proceedings of the 7th International Symposium on Wikis and Open CollaborationOctober 2011Pages 15–24https://doi.org/10.1145/2038558.2038562

Published:03 October 2011Publication History

WikiSym '11: Proceedings of the 7th International Symposium on Wikis and Open Collaboration

Pages 15–24

ABSTRACT

Our everyday observations about the behaviors of others around us shape how we decide to act or interact. In social media the ability to observe and interpret others' behavior is limited. This work describes one approach to leverage everyday behavioral observations to develop tools that could improve understanding and sense making capabilities of contributors, managers and researchers of social media systems. One example of behavioral observation is Wikipedia Barnstars. Barnstars are a type of award recognizing the activities of Wikipedia editors. We mine the entire English Wikipedia to extract barnstar observations. We develop a multi-label classifier based on a random forest technique to recognize and label distinct forms of observed and acknowledged activity. We evaluate the classifier through several means including use of separate training and testing datasets and the by application of the classifier to previously unlabeled data. We use the classifier to identify Wikipedia editors who have been observed with some predominant types of behavior and explore whether those patterns of behavior are evident and how observers seem to be making the observations. We discuss how these types of activity observations can be used to develop tools and potentially improve understanding and analysis in wikis and other online communities.

References

Burke, M. and Kraut, B. 2008. Mopping up: modeling wikipedia promotion decisions. Proceedings of CSCW'08, 27--36. Google ScholarDigital Library
Bryant, S. L., Forte, A. and Bruckman. A. 2005. Becoming Wikipedian: Transformation of Participation in a Collaborative Online Encyclopedia. Proceedings of GROUP'05. 1--10. Google ScholarDigital Library
Cheng, W. and Hullermeier, E. 2009. Combining instance-based learning and logistic regression for multilabel classification. Machine Learning. 76(2-3):211--225. Google ScholarDigital Library
Cosley, D., Frankowski, D., Terveen, L., and Riedl, J. 2007. SuggestBot: Using Intelligent Task Routing to Help People Find Work in Wikipedia. Proceedings of IUI. Google ScholarDigital Library
Geiger, R. S., and Ribes, D. 2010 The work of Sustaining Order in Wikipedia: The Banning of a Vandal. Proceedings of CSCW'10, 117--126. Google ScholarDigital Library
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., and Witten, I. H. 2009. The WEKA Data Mining Software: An Update. SIGKDD Explorations, 11(1). Google ScholarDigital Library
Krieger, M., Stark, E. M., and Klemmer, S. 2009. Coordinating tasks on the commons: designing for personal goals, expertise and serendipity. Proceedings of CHI'09. 1485--1494. Google ScholarDigital Library
Kriplean, T., Beschastnikh I., and McDonald, D. W. 2008. Articulations of wikiwork: uncovering valued work in wikipedia through barnstars. Proceedings of CSCW'08. 47--56. Google ScholarDigital Library
Luther, K., Caine, K., Ziegler, K., and Bruckman, A. 2010. Why It Works (When It Works): Success Factors in Online Creative Collaboration. Proceedings of GROUP'10. 1--10. Google ScholarDigital Library
Sajnani, H., Javanmardi, S., McDonald, D. W., Lopes, C. V. 2011. Multi-Label Classification of Short Text: A Study on Wikipedia Barnstars. In the AAAI-11 Workshop on Analyzing Microtext.Google Scholar
Strauss, A. 1985, Work and the Division of Labor. The Sociological Quarterly. 26, 1. 1--19.Google ScholarCross Ref
Suh, B., Chi, E., Kittur, A., and Pendleton, B. 2008. Lifting the veil: improving accountability and social transparency in Wikipedia with wikidashboard. Proceeding of CHI'08. 1037--1040. Google ScholarDigital Library
Tsoumakas, G., Vilcek, J., Spyromitros, E., Vlahavas, I. (2010) Mulan: A Java Library for Multi-Label Learning. Journal of Machine Learning Research 1, 1--48. Google ScholarDigital Library
Tsoumakas, G. and Katakis, I. 2007. Multi-Label Classification: An Overview. International Journal of Data Warehousing and Mining, 3 (3) 1--13, 2007.Google ScholarCross Ref
Ung, H. and Dalle. J. 2010. Project management in the Wikipedia community. Proceedings of the 6th International Symposium on Wikis and Open Collaboration (WikiSym '10). Google ScholarDigital Library
Viegas, F. B., Wattenberg, M., and Dave, K. 2004. Studying Cooperation and Conflict Between Authors with History Flow Visualizations. In Proceedings of CHI'04. Google ScholarDigital Library
Wattenberg, M., Viégas, F. B., and Hollenbach, K. 2007. Visualizing Activity on Wikipedia with Chromograms. Proceedings of INTERACT 2007. LNCS 4663, Part II. 272--287. Google ScholarDigital Library
Zhang, M. L. and Zhou, Z. H. 2005. A k-Nearest Neighbor Based Algorithm for Multi-label Classification. Proceedings of the 1st IEEE International Conference on Granular Computing.Google Scholar

Index Terms

Finding patterns in behavioral observations by automatically labeling forms of wikiwork in Barnstars
1. Human-centered computing
  1. Collaborative and social computing
    1. Collaborative and social computing theory, concepts and paradigms
      1. Computer supported cooperative work
2. Social and professional topics
  1. Professional topics
    1. Computing and business
      1. Computer supported cooperative work

Recommendations

Automatically Labeling Low Quality Content on Wikipedia By Leveraging Patterns in Editing Behaviors
CSCW2

Wikipedia articles aim to be definitive sources of encyclopedic content. Yet, only 0.6% of Wikipedia articles have high quality according to its quality scale due to insufficient number of Wikipedia editors and enormous number of articles. Supervised ...
Read More
Automatically Generating Wikipedia Info-boxes from Wikidata
WWW '18: Companion Proceedings of the The Web Conference 2018

Info-boxes provide a summary of the most important meta-data relating to a particular entity described by a Wikipedia article. However, many articles have no info-box or have info-boxes with only minimal information; furthermore, there is a huge ...
Read More
Performance analysis of triple-frequency ambiguity resolution with BeiDou observations

We investigate triple-frequency ambiguity resolution performance using real BeiDou data. We test four ambiguity resolution (AR) methods which are applicable to triple-frequency observations. These are least squares ambiguity decorrelation adjustment (...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
WikiSym '11: Proceedings of the 7th International Symposium on Wikis and Open Collaboration
October 2011
245 pages
ISBN:9781450309097
DOI:10.1145/2038558
Conference Chair:
Felipe Ortega
University Rey Juan Carlos, Madrid, Spain
,
Program Chair:
Andrea Forte
Drexel University, Philadelphia
Copyright © 2011 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 3 October 2011
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Wikipedia
behavioral patterns
multi-label learning
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate69of145submissions,48%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 12
  Total Citations
  View Citations
- 149
  Total Downloads
- Downloads (Last 12 months)9
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Finding patterns in behavioral observations by automatically labeling forms of wikiwork in Barnstars

WikiSym '11: Proceedings of the 7th International Symposium on Wikis and Open Collaboration

ABSTRACT

References

Cited By

Index Terms

Recommendations

Automatically Labeling Low Quality Content on Wikipedia By Leveraging Patterns in Editing Behaviors

Automatically Generating Wikipedia Info-boxes from Wikidata

Performance analysis of triple-frequency ambiguity resolution with BeiDou observations