Skip to main content

2016 | OriginalPaper | Buchkapitel

Retrieving Rising Stars in Focused Community Question-Answering

verfasst von : Long T. Le, Chirag Shah

Erschienen in: Intelligent Information and Database Systems

Verlag: Springer Berlin Heidelberg

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In Community Question Answering (CQA)‘ forums, there is typically a small fraction of users who provide high-quality posts and earn a very high reputation status from the community. These top contributors are critical to the community since they drive the development of the site and attract traffic from Internet users. Identifying these individuals could be highly valuable, but this is not an easy task. Unlike publication or social networks, most CQA sites lack information regarding peers, friends, or collaborators, which can be an important indicator signaling future success or performance. In this paper, we attempt to perform this analysis by extracting different sets of features to predict future contribution. The experiment covers 376,000 users who remain active in Stack Overflow for at least one year and together contribute more than 21 million posts. One of the highlights of our approach is that we can identify rising stars after short observations. Our approach achieves high accuracy, 85 %, when predicting whether a user will become a top contributor after a few weeks of observation. As a slightly different problem in which we could observe a few posts by a user, our method achieves accuracy higher than 90 %. Our approach provides higher accuracy than baselines methods including a popular time series analysis. Furthermore, our methods are robust to different classifier algorithms. Identifying the rising stars early could help CQA administrators gain an overview of the site’s future and ensure that enough incentive and support is given to potential contributors.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Adamic, L.A., Zhang, J., Bakshy, E., Ackerman, M.S.: Knowledge sharing and yahoo answers: everyone knows something. In: WWW, pp. 665–674 (2008) Adamic, L.A., Zhang, J., Bakshy, E., Ackerman, M.S.: Knowledge sharing and yahoo answers: everyone knows something. In: WWW, pp. 665–674 (2008)
2.
Zurück zum Zitat Bishop, C.M.: Pattern Recognition and Machine Learning (Information Science and Statistics). Springer, New York (2006)MATH Bishop, C.M.: Pattern Recognition and Machine Learning (Information Science and Statistics). Springer, New York (2006)MATH
3.
Zurück zum Zitat Chawla, N.V.: Data mining for imbalanced datasets: an overview. In: Maimon, O., Rokach, L. (eds.) The Data Mining and Knowledge Discovery Handbook, pp. 853–867. Springer, Heidelberg (2005)CrossRef Chawla, N.V.: Data mining for imbalanced datasets: an overview. In: Maimon, O., Rokach, L. (eds.) The Data Mining and Knowledge Discovery Handbook, pp. 853–867. Springer, Heidelberg (2005)CrossRef
4.
Zurück zum Zitat Daud, A., Abbasi, R., Muhammad, F.: Finding rising stars in social networks. In: Meng, W., Feng, L., Bressan, S., Winiwarter, W., Song, W. (eds.) DASFAA 2013, Part I. LNCS, vol. 7825, pp. 13–24. Springer, Heidelberg (2013)CrossRef Daud, A., Abbasi, R., Muhammad, F.: Finding rising stars in social networks. In: Meng, W., Feng, L., Bressan, S., Winiwarter, W., Song, W. (eds.) DASFAA 2013, Part I. LNCS, vol. 7825, pp. 13–24. Springer, Heidelberg (2013)CrossRef
5.
Zurück zum Zitat Dror, G., Maarek, Y., Szpektor, I.: Will my question be answered? predicting “question answerability” in community question-answering sites. In: Blockeel, H., Kersting, K., Nijssen, S., Železný, F. (eds.) ECML PKDD 2013, Part III. LNCS, vol. 8190, pp. 499–514. Springer, Heidelberg (2013)CrossRef Dror, G., Maarek, Y., Szpektor, I.: Will my question be answered? predicting “question answerability” in community question-answering sites. In: Blockeel, H., Kersting, K., Nijssen, S., Železný, F. (eds.) ECML PKDD 2013, Part III. LNCS, vol. 8190, pp. 499–514. Springer, Heidelberg (2013)CrossRef
6.
Zurück zum Zitat Harper, F.M., Raban, D., Rafaeli, S., Konstan, J.A.: Predictors of answer quality in online q&a sites. In: CHI, pp. 865–874 (2008) Harper, F.M., Raban, D., Rafaeli, S., Konstan, J.A.: Predictors of answer quality in online q&a sites. In: CHI, pp. 865–874 (2008)
7.
Zurück zum Zitat Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning. Springer Series in Statistics. Springer, New York (2009)CrossRefMATH Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning. Springer Series in Statistics. Springer, New York (2009)CrossRefMATH
8.
Zurück zum Zitat Le, L.T., Eliassi-Rad, T., Provost, F., Moores, L.: Hyperlocal: inferring location of ip addresses in real-time bid requests for mobile ads. In: SIGSPATIAL LBSN 2013, pp. 24–33 (2013) Le, L.T., Eliassi-Rad, T., Provost, F., Moores, L.: Hyperlocal: inferring location of ip addresses in real-time bid requests for mobile ads. In: SIGSPATIAL LBSN 2013, pp. 24–33 (2013)
9.
Zurück zum Zitat Li, B., Jin, T., Lyu, M.R., King, I., Mak, B.: Analyzing and predicting question quality in community question answering services. In: WWW, pp. 775–782 (2012) Li, B., Jin, T., Lyu, M.R., King, I., Mak, B.: Analyzing and predicting question quality in community question answering services. In: WWW, pp. 775–782 (2012)
10.
Zurück zum Zitat Li, X.-L., Foo, C.S., Tew, K.L., Ng, S.-K.: Searching for rising stars in bibliography networks. In: Zhou, X., Yokota, H., Deng, K., Liu, Q. (eds.) DASFAA 2009. LNCS, vol. 5463, pp. 288–292. Springer, Heidelberg (2009)CrossRef Li, X.-L., Foo, C.S., Tew, K.L., Ng, S.-K.: Searching for rising stars in bibliography networks. In: Zhou, X., Yokota, H., Deng, K., Liu, Q. (eds.) DASFAA 2009. LNCS, vol. 5463, pp. 288–292. Springer, Heidelberg (2009)CrossRef
11.
Zurück zum Zitat Liu, Q., Agichtein, E., Dror, G., Maarek, Y., Szpektor, I.: When web search fails, searchers become askers: understanding the transition. In: SIGIR, pp. 801–810 (2012) Liu, Q., Agichtein, E., Dror, G., Maarek, Y., Szpektor, I.: When web search fails, searchers become askers: understanding the transition. In: SIGIR, pp. 801–810 (2012)
12.
Zurück zum Zitat Movshovitz-Attias, D., Movshovitz-Attias, Y., Steenkiste, P., Faloutsos, C.: Analysis of the reputation system and user contributions on a question answering website: stackoverflow. In: ASONAM, pp. 886–893 (2013) Movshovitz-Attias, D., Movshovitz-Attias, Y., Steenkiste, P., Faloutsos, C.: Analysis of the reputation system and user contributions on a question answering website: stackoverflow. In: ASONAM, pp. 886–893 (2013)
13.
Zurück zum Zitat Ngonmang, B., Viennet, E., Tchuente, M.: Churn prediction in a real online social network using local community analysis. In: ASONAM, pp. 282–288 (2012) Ngonmang, B., Viennet, E., Tchuente, M.: Churn prediction in a real online social network using local community analysis. In: ASONAM, pp. 282–288 (2012)
14.
Zurück zum Zitat Oentaryo, R.J., Lim, E.-P., Lo, D., Zhu, F., Prasetyo, P.K.: Collective churn prediction in social network. In: ASONAM, pp. 210–214 (2012) Oentaryo, R.J., Lim, E.-P., Lo, D., Zhu, F., Prasetyo, P.K.: Collective churn prediction in social network. In: ASONAM, pp. 210–214 (2012)
15.
Zurück zum Zitat Pal, A., Chang, S., Konstan, J.A.: Evolution of experts in question answering communities. In: ICWSM, pp. 274–281 (2012) Pal, A., Chang, S., Konstan, J.A.: Evolution of experts in question answering communities. In: ICWSM, pp. 274–281 (2012)
16.
Zurück zum Zitat Pudipeddi, J.S., Akoglu, L., Tong, H.: User churn in focused question answering sites: characterizations and prediction. In: WWW Companion 2014, pp. 469–474 (2014) Pudipeddi, J.S., Akoglu, L., Tong, H.: User churn in focused question answering sites: characterizations and prediction. In: WWW Companion 2014, pp. 469–474 (2014)
17.
Zurück zum Zitat Shah, C., Kitzie, V.: Social q&a and virtual reference - comparing apples and oranges with the help of experts and users. J. Am. Soc. Inf. Sci. Technol. 63, 2020–2036 (2012)CrossRef Shah, C., Kitzie, V.: Social q&a and virtual reference - comparing apples and oranges with the help of experts and users. J. Am. Soc. Inf. Sci. Technol. 63, 2020–2036 (2012)CrossRef
18.
Zurück zum Zitat Shah, C., Oh, S., Oh, J.S.: Research agenda for social q&a. Libr. Inf. Sci. Res. 31(4), 205–209 (2009)CrossRef Shah, C., Oh, S., Oh, J.S.: Research agenda for social q&a. Libr. Inf. Sci. Res. 31(4), 205–209 (2009)CrossRef
19.
Zurück zum Zitat Shah, C., Pomerantz, J.: Evaluating and predicting answer quality in community qa. In: SIGIR, pp. 411–418 (2010) Shah, C., Pomerantz, J.: Evaluating and predicting answer quality in community qa. In: SIGIR, pp. 411–418 (2010)
20.
Zurück zum Zitat Shah, C., Radford, M., Connaway, L., Choi, E., Kitzie, V.: How much change do you get from 40\(\$\)? analyzing and addressing failed questions on social q&a. In: ASIST, pp. 1–10 (2012) Shah, C., Radford, M., Connaway, L., Choi, E., Kitzie, V.: How much change do you get from 40\(\$\)? analyzing and addressing failed questions on social q&a. In: ASIST, pp. 1–10 (2012)
21.
Zurück zum Zitat Shumway, R.H., Stoffer, D.S.: Time Series Analysis and Its Applications: With R Examples. Springer Texts in Statistics. Springer, New York (2011)CrossRefMATH Shumway, R.H., Stoffer, D.S.: Time Series Analysis and Its Applications: With R Examples. Springer Texts in Statistics. Springer, New York (2011)CrossRefMATH
22.
Zurück zum Zitat Surowiecki, J.: The Wisdom of Crowds. Anchor, New York (2005) Surowiecki, J.: The Wisdom of Crowds. Anchor, New York (2005)
23.
Zurück zum Zitat White, R.W., Richardson, M.: Effects of expertise differences in synchronous social q&a. In: SIGIR, pp. 1055–1056 (2012) White, R.W., Richardson, M.: Effects of expertise differences in synchronous social q&a. In: SIGIR, pp. 1055–1056 (2012)
24.
Zurück zum Zitat Yang, L., Bao, S., Lin, Q., Wu, X., Han, D., Su, Z., Yu, Y.: Analyzing and predicting not-answered questions in community-based question answering services. In: AAAI (2011) Yang, L., Bao, S., Lin, Q., Wu, X., Han, D., Su, Z., Yu, Y.: Analyzing and predicting not-answered questions in community-based question answering services. In: AAAI (2011)
Metadaten
Titel
Retrieving Rising Stars in Focused Community Question-Answering
verfasst von
Long T. Le
Chirag Shah
Copyright-Jahr
2016
Verlag
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/978-3-662-49390-8_3