ABSTRACT
Online labor markets such as oDesk and Amazon Mechanical Turk have been growing in importance over the last few years. In these markets, employers post tasks on which remote contractors work and deliver the product of their work. As in most online marketplaces, reputation mechanisms play a very important role in facilitating transactions, since they instill trust and are often predictive of the future satisfaction of the employer. However, labor markets are usually highly heterogeneous in terms of available task categories; in such scenarios, past performance may not be a representative signal of future performance. To account for this heterogeneity, in our work, we build models that predict the performance of a worker based on prior, category-specific feedback. Our models assume that each worker has a category-specific quality, which is latent and not directly observable; what is observable, though, is the set of feedback ratings of the worker and of other contractors with similar work histories. Based on this information, we build a multi-level, hierarchical scheme that deals effectively with the data sparseness, which is inherent in many cases of interest (i.e., contractors with relatively brief work histories). We evaluate our models on a large corpus of real transactional data from oDesk, an online labor market with hundreds of millions of dollars in transaction volume. Our results show an improved accuracy of up to 47% compared to the existing baseline.
- L. Adamic, J. Zhang, E. Bakshy, and M. Ackerman. Knowledge sharing and yahoo answers: everyone knows something. In Proceedings of the 17th international conference on World Wide Web. ACM, 2008. Google ScholarDigital Library
- E. Agichtein, C. Castillo, D. Donato, A. Gionis, and G. Mishne. Finding high-quality content in social media. In Proceedings of the international conference on Web search and web data mining. ACM, 2008. Google ScholarDigital Library
- J. Bian, Y. Liu, D. Zhou, E. Agichtein, and H. Zha. Learning to recognize reliable users and content in social media with coupled mutual reinforcement. In Proceedings of the 18th international conference on World Wide Web. ACM, 2009. Google ScholarDigital Library
- R. Clemen and R. Winkler. Unanimity and compromise among probability forecasters. Management Science, 36(7), 1990.Google Scholar
- C. Danescu-Niculescu-Mizil, G. Kossinets, J. Kleinberg, and L. Lee. How opinions are received by online communities: a case study on amazon. com helpfulness votes. In Proceedings of the 18th international conference on World Wide Web. ACM, 2009. Google ScholarDigital Library
- C. Dellarocas. The digitization of word of mouth: Promise and challenges of online feedback mechanisms. Management science, 49(10), 2003. Google ScholarDigital Library
- C. Dellarocas. Reputation mechanisms. Handbook on Economics and Information Systems, 2006.Google ScholarCross Ref
- A. Gelman, J. Carlin, H. Stern, and D. Rubin. Bayesian Data Analysis. Chapman & Hall/CRC, 2004.Google Scholar
- A. Ghose and P. G. Ipeirotis. Estimating the helpfulness and economic impact of product reviews: Mining text and reviewer characteristics. TKDE, 23(10), 2011. Google ScholarDigital Library
- W. Greene. Econometric analysis. Prentice Hall, 2007.Google Scholar
- R. Hambleton. Fundamentals of item response theory, volume 2. Sage Publications, Incorporated, 1991.Google Scholar
- N. Hu, J. Zhang, and P. A. Pavlou. Overcoming the j-shaped distribution of product reviews. Commun. ACM, 52(10), 2009. Google ScholarDigital Library
- J. Jeon, W. Croft, J. Lee, and S. Park. A framework to predict the quality of answers with non-textual features. In Proceedings of the 29th annual international conference on Research and development in information retrieval. ACM, 2006. Google ScholarDigital Library
- S. Kim, P. Pantel, T. Chklovski, and M. Pennacchiotti. Automatically assessing review helpfulness. In Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, 2006. Google ScholarDigital Library
- T. Lappas and D. Gunopulos. Efficient confident search in large review corpora. Machine Learning and Knowledge Discovery in Databases, 2010. Google ScholarDigital Library
- Y. Liu, J. Bian, and E. Agichtein. Predicting information seeker satisfaction in community question answering. In Proceedings of the 31st annual international conference on Research and development in information retrieval. ACM, 2008. Google ScholarDigital Library
- Y. Liu, X. Huang, A. An, and X. Yu. Modeling and predicting the helpfulness of online reviews. In Eighth IEEE International Conference on Data Mining, 2008. ICDM'08. IEEE, 2008. Google ScholarDigital Library
- Y. Lu, P. Tsaparas, A. Ntoulas, and L. Polanyi. Exploiting social context for review quality prediction. In Proceedings of the 19th international conference on World Wide Web. ACM, 2010. Google ScholarDigital Library
- P. Nelson. Information and consumer behavior. The Journal of Political Economy, 78(2), 1970.Google ScholarCross Ref
- M. O'Mahony and B. Smyth. Using readability tests to predict helpful product reviews. In Adaptivity, Personalization and Fusion of Heterogeneous Information, 2010. Google ScholarDigital Library
- J. Otterbacher. 'helpfulness' in online communities: a measure of message quality. In Proceedings of the 27th international conference on Human factors in computing systems. ACM, 2009. Google ScholarDigital Library
- C. Shah and J. Pomerantz. Evaluating and predicting answer quality in community qa. In Proceeding of the 33rd international conference on Research and development in information retrieval. Citeseer, 2010. Google ScholarDigital Library
- M. Suryanto, E. Lim, A. Sun, and R. Chiang. Quality-aware collaborative question answering: methods and evaluation. In Proceedings of the Second ACM International Conference on Web Search and Data Mining. ACM, 2009. Google ScholarDigital Library
- P. Tsaparas, A. Ntoulas, and E. Terzi. Selecting a comprehensive set of reviews. In Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 2011. Google ScholarDigital Library
Index Terms
- Have you done anything like that?: predicting performance using inter-category reputation
Recommendations
Dynamic Recommendations for Sequential Hiring Decisions in Online Labor Markets
KDD '18: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data MiningOnline labor markets facilitate transactions between employers and a diverse set of independent contractors around the globe. When making hiring decisions in these markets, employers have to assess a large and heterogeneous population of contractors. ...
Hiring Behavior Models for Online Labor Markets
WSDM '15: Proceedings of the Eighth ACM International Conference on Web Search and Data MiningIn an online labor marketplace employers post jobs, receive freelancer applications and make hiring decisions. These hiring decisions are based on the freelancer's observed (e.g., education) and latent (e.g., ability) characteristics. Because of the ...
Reputation Transferability in Online Labor Markets
Online workplaces such as oDesk, Amazon Mechanical Turk, and TaskRabbit have been growing in importance over the last few years. In such markets, employers post tasks on which remote contractors work and deliver the product of their work online. As in ...
Comments