skip to main content
10.1145/1837885.1837890acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

The anatomy of a large-scale human computation engine

Published:25 July 2010Publication History

ABSTRACT

In this paper we describe Rabj, an engine designed to simplify collecting human input. We have used Rabj to collect over 2.3 million human judgments to augment data mining, data entry, and curation tasks at Freebase over the course of a year. We illustrate several successful applications that have used Rabj to collect human judgment. We describe how the architecture and design decisions of Rabj are affected by the constraints of content agnosticity, data freshness, latency and visibility. We present work aimed at increasing the yield and reliability of human computation efforts. Finally, we discuss empirical observations and lessons learned in the course of a year of operating the service.

References

  1. K. Bollacker, C. Evans, P. Paritosh, T. Sturge, and J. Taylor. Freebase: a collaboratively created graph database for structuring human knowledge. In SIGMOD '08: Proceedings of the 2008 ACM SIGMOD international conference on Management of data, pages 1247--1250, New York, NY, USA, 2008. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. D. Farber. Google's Marissa Mayer: Speed wins. CNET Between the Lines. http://blogs.zdnet.com/BTL/?p=3925, 2006.Google ScholarGoogle Scholar
  3. D. F. Galletta, R. Henry, S. Mccoy, and P. Polak. Web site delays: How tolerant are users? Journal of the Association for Information Systems, 5:1--28, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  4. A. Kittur, E. H. Chi, and B. Suh. Crowdsourcing user studies with Mechanical Turk. In CHI '08: Proceeding of the twenty-sixth annual SIGCHI conference on Human factors in computing systems, pages 453--456, New York, NY, USA, 2008. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. V. S. Sheng, F. Provost, and P. G. Ipeirotis. Get another label? Improving data quality and data mining using multiple, noisy labelers. In KDD '08: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 614--622, New York, NY, USA, 2008. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. K. Siorpaes and M. Hepp. Games with a purpose for the semantic web. Intelligent Systems, IEEE, 23:50--60, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. R. Snow, B. O'Connor, D. Jurafsky, and A. Y. Ng. Cheap and fast -- but is it good? Evaluating non-expert annotations for natural language tasks. In EMNLP, pages 254--263. ACL, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. L. von Ahn. Games with a purpose. IEE Computer Magazine, 39:92--94, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. L. von Ahn and L. Dabbish. Designing games with a purpose. Communications of the ACM, 51(8):58--67, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. L. von Ahn, M. Kedia, and M. Blum. Verbosity: a game for collecting common-sense facts. In Proceedings of ACM CHI 2006 Conference on Human Factors in Computing Systems, volume 1 of Games, pages 75--78. ACM Press, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. The anatomy of a large-scale human computation engine

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            HCOMP '10: Proceedings of the ACM SIGKDD Workshop on Human Computation
            July 2010
            95 pages
            ISBN:9781450302227
            DOI:10.1145/1837885

            Copyright © 2010 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 25 July 2010

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article

            Upcoming Conference

            KDD '24

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader