Skip to main content

2018 | OriginalPaper | Buchkapitel

Human vs. Automatic Annotation Regarding the Task of Relevance Detection in Social Networks

verfasst von : Nuno Guimarães, Filipe Miranda, Álvaro Figueira

Erschienen in: Advances in Internet, Data & Web Technologies

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The burst of social networks and the possibility of being continuously connected has provided a fast way for information diffusion. More specifically, real-time posting allowed news and events to be reported quicker through social networks than traditional news media. However, the massive data that is daily available makes newsworthy information a needle in a haystack. Therefore, our goal is to build models that can detect journalistic relevance automatically in social networks. In order to do it, it is essential to establish a ground truth with a large number of entries that can provide a suitable basis for the learning algorithms due to the difficulty inherent to the ambiguity and wide scope associated with the concept of relevance. In this paper, we propose and compare two different methodologies to annotate posts regarding their relevance: automatic and human annotation. Preliminary results show that supervised models trained with the automatic annotation methodology tend to perform better than using human annotation in a test dataset labeled by experts.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Achrekar, H., Gandhe, A., Lazarus, R., Yu, S.-H., Liu, B.: Predicting flu trends using Twitter data. In: 2011 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), pp. 702–707, April 2011 Achrekar, H., Gandhe, A., Lazarus, R., Yu, S.-H., Liu, B.: Predicting flu trends using Twitter data. In: 2011 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), pp. 702–707, April 2011
3.
Zurück zum Zitat Aramaki, E., Maskawa, S., Morita, M.: Twitter catches the flu: detecting influenza epidemics using Twitter. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP 2011, pp. 1568–1576, Stroudsburg, PA, USA. Association for Computational Linguistics (2011) Aramaki, E., Maskawa, S., Morita, M.: Twitter catches the flu: detecting influenza epidemics using Twitter. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP 2011, pp. 1568–1576, Stroudsburg, PA, USA. Association for Computational Linguistics (2011)
4.
Zurück zum Zitat Doan, S., Vo, B.-K.H., Collier, N.: An analysis of Twitter messages in the 2011 Tohoku earthquake, pp. 58–66. Springer, Heidelberg (2012) Doan, S., Vo, B.-K.H., Collier, N.: An analysis of Twitter messages in the 2011 Tohoku earthquake, pp. 58–66. Springer, Heidelberg (2012)
5.
Zurück zum Zitat Figueira, A., Guimaraes, N.: Detecting journalistic relevance on social media, a two-case study using automatic surrogate features. In: Proceedings of the IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (2017) Figueira, A., Guimaraes, N.: Detecting journalistic relevance on social media, a two-case study using automatic surrogate features. In: Proceedings of the IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (2017)
6.
Zurück zum Zitat Lampos, V., De Bie, T., Cristianini, N.: Flu detector - tracking epidemics on Twitter, pp. 599–602. Springer, Heidelberg (2010) Lampos, V., De Bie, T., Cristianini, N.: Flu detector - tracking epidemics on Twitter, pp. 599–602. Springer, Heidelberg (2010)
7.
Zurück zum Zitat Lee, K., Agrawal, A., Choudhary, A.: Real-time disease surveillance using Twitter data: demonstration on flu and cancer. In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2013, pp. 1474–1477. ACM, New York (2013) Lee, K., Agrawal, A., Choudhary, A.: Real-time disease surveillance using Twitter data: demonstration on flu and cancer. In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2013, pp. 1474–1477. ACM, New York (2013)
8.
Zurück zum Zitat Mendoza, M., Poblete, B., Castillo, C.: Twitter under crisis: can we trust what we RT? In: Proceedings of the First Workshop on Social Media Analytics, SOMA 2010, pp. 71–79. ACM, New York (2010) Mendoza, M., Poblete, B., Castillo, C.: Twitter under crisis: can we trust what we RT? In: Proceedings of the First Workshop on Social Media Analytics, SOMA 2010, pp. 71–79. ACM, New York (2010)
9.
Zurück zum Zitat Muralidharan, S., Rasmussen, L., Patterson, D., Shin, J.-H.: Hope for Haiti: an analysis of facebook and Twitter usage during the earthquake relief efforts. Publ. Relat. Rev. 37(2), 175–177 (2011)CrossRef Muralidharan, S., Rasmussen, L., Patterson, D., Shin, J.-H.: Hope for Haiti: an analysis of facebook and Twitter usage during the earthquake relief efforts. Publ. Relat. Rev. 37(2), 175–177 (2011)CrossRef
11.
Zurück zum Zitat Ribeiro, F.N., Araújo, M., Gonçalves, P., Gonçalves, M.A., Benevenuto, F.: Sentibench - a benchmark comparison of state-of-the-practice sentiment analysis methods. EPJ Data Sci. 5(1), 23 (2016)CrossRef Ribeiro, F.N., Araújo, M., Gonçalves, P., Gonçalves, M.A., Benevenuto, F.: Sentibench - a benchmark comparison of state-of-the-practice sentiment analysis methods. EPJ Data Sci. 5(1), 23 (2016)CrossRef
12.
Zurück zum Zitat Robinson, B., Power, R., Cameron, M.: A sensitive Twitter earthquake detector. In: Proceedings of the 22nd International Conference on World Wide Web, WWW 2013 Companion, pp. 999–1002. ACM, New York (2013) Robinson, B., Power, R., Cameron, M.: A sensitive Twitter earthquake detector. In: Proceedings of the 22nd International Conference on World Wide Web, WWW 2013 Companion, pp. 999–1002. ACM, New York (2013)
13.
Zurück zum Zitat Sakaki, T., Okazaki, M., Matsuo, Y.: Earthquake shakes Twitter users: real-time event detection by social sensors. In: Proceedings of the 19th International Conference on World Wide Web, WWW 2010, pp. 851–860. ACM, New York (2010) Sakaki, T., Okazaki, M., Matsuo, Y.: Earthquake shakes Twitter users: real-time event detection by social sensors. In: Proceedings of the 19th International Conference on World Wide Web, WWW 2010, pp. 851–860. ACM, New York (2010)
14.
Zurück zum Zitat Sandim, M., Fortuna, P., Figueira, A., Oliveira, L.: Journalistic relevance classification in social network messages: an exploratory approach, pp. 631–642. Springer, Cham (2017) Sandim, M., Fortuna, P., Figueira, A., Oliveira, L.: Journalistic relevance classification in social network messages: an exploratory approach, pp. 631–642. Springer, Cham (2017)
15.
Zurück zum Zitat Sankaranarayanan, J., Samet, H., Teitler, B.E., Lieberman, M.D., Sperling, J.: Twitterstand: news in tweets. In: Proceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, GIS 2009, pp. 42–51. ACM, New York (2009) Sankaranarayanan, J., Samet, H., Teitler, B.E., Lieberman, M.D., Sperling, J.: Twitterstand: news in tweets. In: Proceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, GIS 2009, pp. 42–51. ACM, New York (2009)
17.
Zurück zum Zitat Yasseri, T., Spoerri, A., Graham, M., Kertész, J.: The most controversial topics in Wikipedia: a multilingual and geographical analysis. CoRR, abs/1305.5566 (2013) Yasseri, T., Spoerri, A., Graham, M., Kertész, J.: The most controversial topics in Wikipedia: a multilingual and geographical analysis. CoRR, abs/1305.5566 (2013)
18.
Zurück zum Zitat Figueira, Á., Oliveira, L.: The current state of fake news: challenges and opportunities. Procedia Comput. Sci. 121, 817–825 (2017). CENTERIS 2017 - International Conference on ENTERprise Information Systems/ProjMAN 2017 - International Conference on Project MANagement/HCist 2017 - International Conference on Health and Social Care Information Systems and Technologies, CENTERIS/ProjMAN/HCist (2017) Figueira, Á., Oliveira, L.: The current state of fake news: challenges and opportunities. Procedia Comput. Sci. 121, 817–825 (2017). CENTERIS 2017 - International Conference on ENTERprise Information Systems/ProjMAN 2017 - International Conference on Project MANagement/HCist 2017 - International Conference on Health and Social Care Information Systems and Technologies, CENTERIS/ProjMAN/HCist (2017)
Metadaten
Titel
Human vs. Automatic Annotation Regarding the Task of Relevance Detection in Social Networks
verfasst von
Nuno Guimarães
Filipe Miranda
Álvaro Figueira
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-75928-9_85

Premium Partner