research-article

Have you done anything like that?: predicting performance using inter-category reputation

Authors:
Marios Kokkodis

NYU Stern School of Business, New York, NY, USA

NYU Stern School of Business, New York, NY, USA
View Profile

,
Panagiotis G. Ipeirotis

NYU Stern school of business, New York, NY, USA

NYU Stern school of business, New York, NY, USA
View Profile

WSDM '13: Proceedings of the sixth ACM international conference on Web search and data miningFebruary 2013Pages 435–444https://doi.org/10.1145/2433396.2433450

Published:04 February 2013Publication History

WSDM '13: Proceedings of the sixth ACM international conference on Web search and data mining

Pages 435–444

ABSTRACT

Online labor markets such as oDesk and Amazon Mechanical Turk have been growing in importance over the last few years. In these markets, employers post tasks on which remote contractors work and deliver the product of their work. As in most online marketplaces, reputation mechanisms play a very important role in facilitating transactions, since they instill trust and are often predictive of the future satisfaction of the employer. However, labor markets are usually highly heterogeneous in terms of available task categories; in such scenarios, past performance may not be a representative signal of future performance. To account for this heterogeneity, in our work, we build models that predict the performance of a worker based on prior, category-specific feedback. Our models assume that each worker has a category-specific quality, which is latent and not directly observable; what is observable, though, is the set of feedback ratings of the worker and of other contractors with similar work histories. Based on this information, we build a multi-level, hierarchical scheme that deals effectively with the data sparseness, which is inherent in many cases of interest (i.e., contractors with relatively brief work histories). We evaluate our models on a large corpus of real transactional data from oDesk, an online labor market with hundreds of millions of dollars in transaction volume. Our results show an improved accuracy of up to 47% compared to the existing baseline.

References

L. Adamic, J. Zhang, E. Bakshy, and M. Ackerman. Knowledge sharing and yahoo answers: everyone knows something. In Proceedings of the 17th international conference on World Wide Web. ACM, 2008. Google ScholarDigital Library
E. Agichtein, C. Castillo, D. Donato, A. Gionis, and G. Mishne. Finding high-quality content in social media. In Proceedings of the international conference on Web search and web data mining. ACM, 2008. Google ScholarDigital Library
J. Bian, Y. Liu, D. Zhou, E. Agichtein, and H. Zha. Learning to recognize reliable users and content in social media with coupled mutual reinforcement. In Proceedings of the 18th international conference on World Wide Web. ACM, 2009. Google ScholarDigital Library
R. Clemen and R. Winkler. Unanimity and compromise among probability forecasters. Management Science, 36(7), 1990.Google Scholar
C. Danescu-Niculescu-Mizil, G. Kossinets, J. Kleinberg, and L. Lee. How opinions are received by online communities: a case study on amazon. com helpfulness votes. In Proceedings of the 18th international conference on World Wide Web. ACM, 2009. Google ScholarDigital Library
C. Dellarocas. The digitization of word of mouth: Promise and challenges of online feedback mechanisms. Management science, 49(10), 2003. Google ScholarDigital Library
C. Dellarocas. Reputation mechanisms. Handbook on Economics and Information Systems, 2006.Google ScholarCross Ref
A. Gelman, J. Carlin, H. Stern, and D. Rubin. Bayesian Data Analysis. Chapman & Hall/CRC, 2004.Google Scholar
A. Ghose and P. G. Ipeirotis. Estimating the helpfulness and economic impact of product reviews: Mining text and reviewer characteristics. TKDE, 23(10), 2011. Google ScholarDigital Library
W. Greene. Econometric analysis. Prentice Hall, 2007.Google Scholar
R. Hambleton. Fundamentals of item response theory, volume 2. Sage Publications, Incorporated, 1991.Google Scholar
N. Hu, J. Zhang, and P. A. Pavlou. Overcoming the j-shaped distribution of product reviews. Commun. ACM, 52(10), 2009. Google ScholarDigital Library
J. Jeon, W. Croft, J. Lee, and S. Park. A framework to predict the quality of answers with non-textual features. In Proceedings of the 29th annual international conference on Research and development in information retrieval. ACM, 2006. Google ScholarDigital Library
S. Kim, P. Pantel, T. Chklovski, and M. Pennacchiotti. Automatically assessing review helpfulness. In Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, 2006. Google ScholarDigital Library
T. Lappas and D. Gunopulos. Efficient confident search in large review corpora. Machine Learning and Knowledge Discovery in Databases, 2010. Google ScholarDigital Library
Y. Liu, J. Bian, and E. Agichtein. Predicting information seeker satisfaction in community question answering. In Proceedings of the 31st annual international conference on Research and development in information retrieval. ACM, 2008. Google ScholarDigital Library
Y. Liu, X. Huang, A. An, and X. Yu. Modeling and predicting the helpfulness of online reviews. In Eighth IEEE International Conference on Data Mining, 2008. ICDM'08. IEEE, 2008. Google ScholarDigital Library
Y. Lu, P. Tsaparas, A. Ntoulas, and L. Polanyi. Exploiting social context for review quality prediction. In Proceedings of the 19th international conference on World Wide Web. ACM, 2010. Google ScholarDigital Library
P. Nelson. Information and consumer behavior. The Journal of Political Economy, 78(2), 1970.Google ScholarCross Ref
M. O'Mahony and B. Smyth. Using readability tests to predict helpful product reviews. In Adaptivity, Personalization and Fusion of Heterogeneous Information, 2010. Google ScholarDigital Library
J. Otterbacher. 'helpfulness' in online communities: a measure of message quality. In Proceedings of the 27th international conference on Human factors in computing systems. ACM, 2009. Google ScholarDigital Library
C. Shah and J. Pomerantz. Evaluating and predicting answer quality in community qa. In Proceeding of the 33rd international conference on Research and development in information retrieval. Citeseer, 2010. Google ScholarDigital Library
M. Suryanto, E. Lim, A. Sun, and R. Chiang. Quality-aware collaborative question answering: methods and evaluation. In Proceedings of the Second ACM International Conference on Web Search and Data Mining. ACM, 2009. Google ScholarDigital Library
P. Tsaparas, A. Ntoulas, and E. Terzi. Selecting a comprehensive set of reviews. In Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 2011. Google ScholarDigital Library

Index Terms

Have you done anything like that?: predicting performance using inter-category reputation
1. Information systems

Recommendations

Dynamic Recommendations for Sequential Hiring Decisions in Online Labor Markets
KDD '18: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Online labor markets facilitate transactions between employers and a diverse set of independent contractors around the globe. When making hiring decisions in these markets, employers have to assess a large and heterogeneous population of contractors. ...
Read More
Hiring Behavior Models for Online Labor Markets
WSDM '15: Proceedings of the Eighth ACM International Conference on Web Search and Data Mining

In an online labor marketplace employers post jobs, receive freelancer applications and make hiring decisions. These hiring decisions are based on the freelancer's observed (e.g., education) and latent (e.g., ability) characteristics. Because of the ...
Read More
Reputation Transferability in Online Labor Markets

Online workplaces such as oDesk, Amazon Mechanical Turk, and TaskRabbit have been growing in importance over the last few years. In such markets, employers post tasks on which remote contractors work and deliver the product of their work online. As in ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
WSDM '13: Proceedings of the sixth ACM international conference on Web search and data mining
February 2013
816 pages
ISBN:9781450318693
DOI:10.1145/2433396
General Chairs:
Stefano Leonardi
Sapienza University of Rome, Italy
,
Alessandro Panconesi
Sapienza University of Rome, Italy
,
Program Chairs:
Paolo Ferragina
University of Pisa, Italy
,
Aristides Gionis
Yahoo! Research, Barcelona, Spain
Copyright © 2013 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 4 February 2013
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
bayesian modeling
online labor markets
reputation
task performance prediction
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate498of2,863submissions,17%
Upcoming Conference
WSDM '25

Sponsor:

sigir

sigir

sigir

sigir

The Eighteenth ACM International Conference on Web Search and Data Mining

April 7 - 11, 2025

Hannover , Germany
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 16
  Total Citations
  View Citations
- 275
  Total Downloads
- Downloads (Last 12 months)3
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Have you done anything like that?: predicting performance using inter-category reputation

WSDM '13: Proceedings of the sixth ACM international conference on Web search and data mining

ABSTRACT

References

Cited By

Index Terms

Recommendations

Dynamic Recommendations for Sequential Hiring Decisions in Online Labor Markets

Hiring Behavior Models for Online Labor Markets

Reputation Transferability in Online Labor Markets