Skip to main content

2017 | OriginalPaper | Buchkapitel

On Quality Assesement in Wikipedia Articles Based on Markov Random Fields

verfasst von : Rajmund Kleminski, Tomasz Kajdanowicz, Roman Bartusiak, Przemyslaw Kazienko

Erschienen in: Intelligent Information and Database Systems

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This article investigates the possibility of accurate quality prediction of resources generated by communities based on the crowd-generated content. We use data from Wikipedia, the prime example of community-run site, as our object of study. We define the quality as a distribution of user-assigned grades across a predefined range of possible scores and present a measure of distribution similarity to quantify the accuracy of a prediction. The proposed method of quality prediction is based on Markov Random Field and its Loopy Belief Propagation implementation. Based on our results, we highlight key problems in the approach as presented, as well as trade-offs caused by relying solely on network structure and characteristics, excluding metadata. The overall results of content quality prediction are promising in homophilic networks.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat De la Calzada, G., Dekhtyar, A.: On measuring the quality of wikipedia articles. In: Proceedings of the 4th Workshop on Information Credibility, WICOW 2010, pp. 11–18. ACM (2010) De la Calzada, G., Dekhtyar, A.: On measuring the quality of wikipedia articles. In: Proceedings of the 4th Workshop on Information Credibility, WICOW 2010, pp. 11–18. ACM (2010)
2.
Zurück zum Zitat Dalip, D.H., Gonçalves, M.A., Cristo, M., Calado, P.: Automatic quality assessment of content created collaboratively by web communities: a case study of wikipedia. In: Proceedings of the 9th ACM/IEEE-CS Joint Conference on Digital Libraries, JCDL 2009, NY, USA, pp. 295–304. ACM, New York (2009) Dalip, D.H., Gonçalves, M.A., Cristo, M., Calado, P.: Automatic quality assessment of content created collaboratively by web communities: a case study of wikipedia. In: Proceedings of the 9th ACM/IEEE-CS Joint Conference on Digital Libraries, JCDL 2009, NY, USA, pp. 295–304. ACM, New York (2009)
3.
Zurück zum Zitat Hu, M., Lim, E.P., Sun, A., Lauw, H.W., Vuong, B.Q.: Measuring article quality in wikipedia: models and evaluation. In: Proceedings of the Sixteenth ACM Conference on Conference on Information and Knowledge Management, CIKM 2007, pp. 243–252. ACM (2007) Hu, M., Lim, E.P., Sun, A., Lauw, H.W., Vuong, B.Q.: Measuring article quality in wikipedia: models and evaluation. In: Proceedings of the Sixteenth ACM Conference on Conference on Information and Knowledge Management, CIKM 2007, pp. 243–252. ACM (2007)
4.
Zurück zum Zitat Kazienko, P., Kajdanowicz, T.: Label-dependent node classification in the network. Neurocomputing 75(1), 199–209 (2012)CrossRef Kazienko, P., Kajdanowicz, T.: Label-dependent node classification in the network. Neurocomputing 75(1), 199–209 (2012)CrossRef
5.
Zurück zum Zitat Liu, J., Ram, S.: Who does what: collaboration patterns in the wikipedia and their impact on article quality. ACM Trans. Manage. Inf. Syst. 2(2), 11:1–11:23 (2011)CrossRef Liu, J., Ram, S.: Who does what: collaboration patterns in the wikipedia and their impact on article quality. ACM Trans. Manage. Inf. Syst. 2(2), 11:1–11:23 (2011)CrossRef
6.
Zurück zum Zitat Malewicz, G., Austern, M.H., Bik, A.J., Dehnert, J.C., Horn, I., Leiser, N., Czajkowski, G.: Pregel: a system for large-scale graph processing. In: Proceedings of the 2010 ACM SIGMOD International Conference on Management of data, pp. 135–146. ACM (2010) Malewicz, G., Austern, M.H., Bik, A.J., Dehnert, J.C., Horn, I., Leiser, N., Czajkowski, G.: Pregel: a system for large-scale graph processing. In: Proceedings of the 2010 ACM SIGMOD International Conference on Management of data, pp. 135–146. ACM (2010)
7.
Zurück zum Zitat McPherson, M., Smith-Lovin, L., Cook, J.M.: Birds of a feather: homophily in social networks. Ann. Rev. Sociol. 27(1), 415–444 McPherson, M., Smith-Lovin, L., Cook, J.M.: Birds of a feather: homophily in social networks. Ann. Rev. Sociol. 27(1), 415–444
8.
Zurück zum Zitat Sen, P., Namata, G.M., Bilgic, M., Getoor, L., Gallagher, B., Eliassi-Rad, T.: Collective classification in network data. AI Mag. 29(3), 93–106 (2008) Sen, P., Namata, G.M., Bilgic, M., Getoor, L., Gallagher, B., Eliassi-Rad, T.: Collective classification in network data. AI Mag. 29(3), 93–106 (2008)
9.
Zurück zum Zitat Suzuki, Y., Yoshikawa, M.: Mutual evaluation of editors and texts for assessing quality of wikipedia articles. In: Proceedings of the Eighth Annual International Symposium on Wikis and Open Collaboration, WikiSym 2012, NY, USA, pp. 18:1–18:10. ACM, New York (2012) Suzuki, Y., Yoshikawa, M.: Mutual evaluation of editors and texts for assessing quality of wikipedia articles. In: Proceedings of the Eighth Annual International Symposium on Wikis and Open Collaboration, WikiSym 2012, NY, USA, pp. 18:1–18:10. ACM, New York (2012)
10.
Zurück zum Zitat Taskar, B., Abbeel, P., Koller, D.: Discriminative probabilistic models for relational data. In: Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence, UAI 2002, pp. 485–492. Morgan Kaufmann Publishers Inc., San Francisco (2002) Taskar, B., Abbeel, P., Koller, D.: Discriminative probabilistic models for relational data. In: Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence, UAI 2002, pp. 485–492. Morgan Kaufmann Publishers Inc., San Francisco (2002)
11.
Zurück zum Zitat Valiant, L.G.: A bridging model for parallel computation. Commun. ACM 33(8), 103–111 (1990)CrossRef Valiant, L.G.: A bridging model for parallel computation. Commun. ACM 33(8), 103–111 (1990)CrossRef
12.
Zurück zum Zitat Wohner, T., Peters, R.: Assessing the quality of wikipedia articles with lifecycle based metrics. In: Proceedings of the 5th International Symposium on Wikis and Open Collaboration, WikiSym 2009, NY, USA, pp. 16:1–16:10. ACM, New York (2009) Wohner, T., Peters, R.: Assessing the quality of wikipedia articles with lifecycle based metrics. In: Proceedings of the 5th International Symposium on Wikis and Open Collaboration, WikiSym 2009, NY, USA, pp. 16:1–16:10. ACM, New York (2009)
13.
Zurück zum Zitat Xin, R.S., Gonzalez, J.E., Franklin, M.J., Stoica, I.: Graphx: a resilient distributed graph system on spark. In: First International Workshop on Graph Data Management Experiences and Systems, p. 2. ACM (2013) Xin, R.S., Gonzalez, J.E., Franklin, M.J., Stoica, I.: Graphx: a resilient distributed graph system on spark. In: First International Workshop on Graph Data Management Experiences and Systems, p. 2. ACM (2013)
Metadaten
Titel
On Quality Assesement in Wikipedia Articles Based on Markov Random Fields
verfasst von
Rajmund Kleminski
Tomasz Kajdanowicz
Roman Bartusiak
Przemyslaw Kazienko
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-54472-4_73

Premium Partner