research-article

Trust, but verify: predicting contribution quality for knowledge base construction and curation

Authors:
Chun How Tan

Google, Mountain View, USA

Google, Mountain View, USA
View Profile

,
Eugene Agichtein

Emory University, Google, Atlanta, USA

Emory University, Google, Atlanta, USA
View Profile

,
Panos Ipeirotis

New York University, Google, New York, USA

New York University, Google, New York, USA
View Profile

,
Evgeniy Gabrilovich

Google, Mountain View, USA

Google, Mountain View, USA
View Profile

WSDM '14: Proceedings of the 7th ACM international conference on Web search and data miningFebruary 2014Pages 553–562https://doi.org/10.1145/2556195.2556227

Published:24 February 2014Publication History

WSDM '14: Proceedings of the 7th ACM international conference on Web search and data mining

Pages 553–562

ABSTRACT

The largest publicly available knowledge repositories, such as Wikipedia and Freebase, owe their existence and growth to volunteer contributors around the globe. While the majority of contributions are correct, errors can still creep in, due to editors' carelessness, misunderstanding of the schema, malice, or even lack of accepted ground truth. If left undetected, inaccuracies often degrade the experience of users and the performance of applications that rely on these knowledge repositories. We present a new method, CQUAL, for automatically predicting the quality of contributions submitted to a knowledge base. Significantly expanding upon previous work, our method holistically exploits a variety of signals, including the user's domains of expertise as reflected in her prior contribution history, and the historical accuracy rates of different types of facts. In a large-scale human evaluation, our method exhibits precision of 91% at 80% recall. Our model verifies whether a contribution is correct immediately after it is submitted, significantly alleviating the need for post-submission human reviewing.

References

L. A. Adamic, J. Zhang, E. Bakshy, and M. S. Ackerman. Knowledge sharing and yahoo answers: Everyone knows something. In WWW, 2008. Google ScholarDigital Library
B. T. Adler, L. de Alfaro, I. Pye, and V. Raman. Measuring author contributions to the wikipedia. In 4th Int'l Symposium on Wikis. ACM, 2008. Google ScholarDigital Library
E. Agichtein, C. Castillo, D. Donato, A. Gionis, and G. Mishne. Finding high-quality content in social media. In WSDM, 2008. Google ScholarDigital Library
A. Anderson, D. Huttenlocher, J. Kleinberg, and J. Leskovec. Steering user behavior with badges. In WWW, pages 95--106, 2013. Google ScholarDigital Library
Y. Bachrach, T. Graepel, T. Minka, and J. Guiver. How to grade a test without knowing the answers|a bayesian graphical model for adaptive crowdsourcing and aptitude testing. arXiv preprint arXiv:1206.6386, 2012.Google Scholar
J. Bian, Y. Liu, D. Zhou, E. Agichtein, and H. Zha. Learning to recognize reliable users and content in social media with coupled mutual reinforcement. In WWW, 2009. Google ScholarDigital Library
C. Bizer, J. Lehmann, G. Kobilarov, S. Auer, C. Becker, R. Cyganiak, and S. Hellmann. Dbpedia - a crystallization point for the web of data. Web Semantics, 7(3):154--165, Sept. 2009. Google ScholarDigital Library
D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. JMLR, 3:993--1022, 2003. Google ScholarDigital Library
C. Dellarocas. The digitization of word of mouth: Promise and challenges of online feedback mechanisms. Management Science, 2003. Google ScholarDigital Library
C. Dellarocas. Reputation mechanisms. In Handbook on Economics and Information Systems. Elsevier Publishing, 2006.Google Scholar
O. Deshpande, D. S. Lamba, M. Tourn, S. Das, S. Subramaniam, A. Rajaraman, V. Harinarayan, and A. Doan. Building, maintaining, and using knowledge bases: A report from the trenches. In SIGMOD, 2013. Google ScholarDigital Library
J. Duchi and Y. Singer. Boosting with structural sparsity. In ICML, pages 297--304, 2009. Google ScholarDigital Library
S. Dumais and H. Chen. Hierarchical classification of web content. In SIGIR'00, pages 256--263, 2000. Google ScholarDigital Library
S. E. Embretson and S. P. Reise. Item response theory. Psychology Press, 2000.Google ScholarCross Ref
Y. Freund and R. E. Schapire. Large margin classification using the perceptron algorithm. Machine learning, 37(3):277--296, 1999. Google ScholarDigital Library
A. Galland, S. Abiteboul, A. Marian, and P. Senellart. Corroborating information from disagreeing views. In WSDM, 2010. Google ScholarDigital Library
A. Halfaker, A. Kittur, R. Kraut, and J. Riedl. A jury of your peers: quality, experience and ownership in wikipedia. In 5th Int'l Symposium on Wikis and Open Collaboration, 2009. Google ScholarDigital Library
J. Hoffart, F. M. Suchanek, K. Berberich, and G. Weikum. YAGO2: A spatially and temporally enhanced knowledge base from wikipedia. Artificial Intelligence, 194:28--61, Jan. 2013. Google ScholarDigital Library
P. G. Ipeirotis, F. Provost, and J. Wang. Quality management on amazon mechanical turk. In KDD Workshop on Human computation, pages 64--67, 2010. Google ScholarDigital Library
J. Jeon, W. B. Croft, J. H. Lee, and S. Park. A framework to predict the quality of answers with non-textual features. In SIGIR, 2006. Google ScholarDigital Library
S. Kochhar, S. Mazzocchi, and P. Paritosh. The anatomy of a large-scale human computation engine. In KDD Workshop on Human Computation, pages 10--17, 2010. Google ScholarDigital Library
M. Kokkodis and P. G. Ipeirotis. Have you done anything like that?: predicting performance using inter-category reputation. In WSDM, pages 435--444, 2013. Google ScholarDigital Library
D. Koller and M. Sahami. Hierarchically classifying documents using very few words. In ICML, pages 170--178, 1997. Google ScholarDigital Library
Y. Liu, J. Bian, and E. Agichtein. Predicting information seeker satisfaction in community question answering. In SIGIR, 2008. Google ScholarDigital Library
V. C. Raykar, S. Yu, L. H. Zhao, G. H. Valadez, C. Florin, L. Bogoni, and L. Moy. Learning from crowds. JMLR, 99:1297--1322, 2010. Google ScholarDigital Library
M. E. Ruiz and P. Srinivasan. Hierarchical text categorization using neural networks. Information Retrieval, 5:87--118, 2002. Google ScholarDigital Library
J. Rzeszotarski and A. Kittur. Learning from history: predicting reverted work at the word level in wikipedia. In Computer Supported Cooperative Work, pages 437--440, 2012. Google ScholarDigital Library
J. M. Rzeszotarski and A. Kittur. Instrumenting the crowd: using implicit behavioral measures to predict task performance. In Annual symposium on User interface software and technology, pages 13--22. ACM, 2011. Google ScholarDigital Library
C. Shah and J. Pomerantz. Evaluating and predicting answer quality in community QA. In SIGIR, 2010. Google ScholarDigital Library
V. S. Sheng, F. Provost, and P. G. Ipeirotis. Get another label? improving data quality and data mining using multiple, noisy labelers. In KDD, pages 614--622. ACM, 2008. Google ScholarDigital Library
R. Snow, B. O'Connor, D. Jurafsky, and A. Y. Ng. Cheap and fast|but is it good?: evaluating non-expert annotations for natural language tasks. In EMNLP, pages 254--263. Association for Computational Linguistics, 2008. Google ScholarDigital Library
M. A. Suryanto, E.-P. Lim, and A. S. R. H. L. Chiang. Quality-aware collaborative question answering: Methods and evaluation. In WSDM, 2009. Google ScholarDigital Library
F. B. Viégas, M. Wattenberg, and K. Dave. Studying cooperation and conict between authors with history flow visualizations. In CHI, pages 575--582, 2004. Google ScholarDigital Library
E. M. Voorhees. Overview of trec 2003. In TREC, pages 1--13, 2003.Google Scholar
P. Welinder, S. Branson, S. Belongie, and P. Perona. The multidimensional wisdom of crowds. NIPS, 23:2424--2432, 2010.Google Scholar
M. Wick, K. Schultz, and A. McCallum. Human-machine cooperation with epistemological dbs: supporting user corrections to knowledge bases. In AKBC Workshop, pages 89--94. ACL, 2012. Google ScholarDigital Library

Index Terms

Trust, but verify: predicting contribution quality for knowledge base construction and curation
1. Information systems
  1. Information systems applications

Recommendations

Surface Name Errors in Wikipedia
CODS-COMAD '23: Proceedings of the 6th Joint International Conference on Data Science & Management of Data (10th ACM IKDD CODS and 28th COMAD)

Surface name is the string used to refer to an entity in a text corpus. Crowd-sourced knowledge repositories such as Wikipedia can have multiple types of errors, including surface name errors. This paper focuses on identifying and correcting surface ...
Read More
LONLIES: Estimating Property Values for Long Tail Entities
SIGIR '16: Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval

Web search engines often retrieve answers for queries about popular entities from a growing knowledge base that is populated by a continuous information extraction process. However, less popular entities are not frequently mentioned on the web and are ...
Read More
HighLife: Higher-arity Fact Harvesting
WWW '18: Proceedings of the 2018 World Wide Web Conference

Text-based knowledge extraction methods for populating knowledge bases have focused on binary facts: relationships between two entities. However, in advanced domains such as health, it is often crucial to consider ternary and higher-arity relations. An ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
WSDM '14: Proceedings of the 7th ACM international conference on Web search and data mining
February 2014
712 pages
ISBN:9781450323512
DOI:10.1145/2556195
General Chairs:
Ben Carterette
University of Delaware, USA
,
Fernando Diaz
Microsoft Research, USA
,
Program Chairs:
Carlos Castillo
Qatar Computing Research Institute, Qatar
,
Donald Metzler
Google, USA
Copyright © 2014 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 24 February 2014
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
crowdsourcing
knowledge base construction
predicting contribution quality
Qualifiers
- research-article
Conference

Acceptance Rates
WSDM '14 Paper Acceptance Rate64of355submissions,18%Overall Acceptance Rate498of2,863submissions,17%
More
Upcoming Conference
WSDM '25

Sponsor:

sigir

sigir

sigir

sigir

The Eighteenth ACM International Conference on Web Search and Data Mining

April 7 - 11, 2025

Hannover , Germany
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 22
  Total Citations
  View Citations
- 399
  Total Downloads
- Downloads (Last 12 months)15
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Trust, but verify: predicting contribution quality for knowledge base construction and curation

WSDM '14: Proceedings of the 7th ACM international conference on Web search and data mining

ABSTRACT

References

Cited By

Index Terms

Recommendations

Surface Name Errors in Wikipedia

LONLIES: Estimating Property Values for Long Tail Entities

HighLife: Higher-arity Fact Harvesting