Skip to main content

2019 | OriginalPaper | Buchkapitel

Who Tweets in Italian? Demographic Characteristics of Twitter Users

verfasst von : Righi Alessandra, Mauro M. Gentile, Domenico M. Bianco

Erschienen in: New Statistical Developments in Data Science

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper we try for the first time to shed light on the use of Twitter by the Italian speaking users quantifying the total audience and some relevant characteristics: in particular, gender and location. The attempt is based on publicly available APIs data referring both to profile documents and tweets. Through real-time calculation is possible to infer the gender mainly using the name field of the users’ profile, while the geo-location is deduced using the location field and the geotagged tweets.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
According to Alexa.
 
3
For further information see [22] and the references therein.
 
5
According to wikipedia, there are 64 million native Italian speakers in the EU and 85 million in the world when in Italy there are 61 million inhabitants. Regarding English, there are 360–400 million native speakers and 600–700 million people that speaks English as a second language.
 
7
The enterprises whose websites were scraped in the cited study, were the majority (64%) of the enterprises (with 10 employees and over) having a website, but only the half of these enterprises presented links to social media.
 
8
For example, consider a company named “rossi” and the username “alexRossi”. The username contains the company name but the remaining letters can be interpreted as a male proper name and hence the username is not labelled as a company.
 
9
i.e. the Italian National Institute of Statistics list of municipalities, containing 7978 Italian municipalities.
 
10
We tried also to determine the users profession using the bio field, through a list of roughly 1000 professions. Results were absolutely not satisfactory maybe because the bio field is an open field that each user interprets in her own way.
 
Literatur
3.
Zurück zum Zitat Burger, J.D., Henderson, J., Kim, G., Zarrella, G.: Discriminating gender on twitter. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP ’11, pp. 1301–1309, Stroudsburg, PA, USA. Association for Computational Linguistics. ISBN 978-1-937284-11-4. http://dl.acm.org/citation.cfm?id=2145432.2145568 (2011) Burger, J.D., Henderson, J., Kim, G., Zarrella, G.: Discriminating gender on twitter. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP ’11, pp. 1301–1309, Stroudsburg, PA, USA. Association for Computational Linguistics. ISBN 978-1-937284-11-4. http://​dl.​acm.​org/​citation.​cfm?​id=​2145432.​2145568 (2011)
5.
Zurück zum Zitat Chang, J., Rosenn, I., Backstrom, L., Marlow,C.: Epluribus: Ethnicity on social networks. In: ICWSM (2010) Chang, J., Rosenn, I., Backstrom, L., Marlow,C.: Epluribus: Ethnicity on social networks. In: ICWSM (2010)
6.
Zurück zum Zitat Cheng, Z., Caverlee, J., Lee, K.: You are where you tweet: a content-based approach to geo-locating twitter users. In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management, CIKM ’10, New York, NY, USA, pp. 759–768. ACM. ISBN 978-1-4503-0099-5. https://doi.org/10.1145/1871437.1871535 (2010) Cheng, Z., Caverlee, J., Lee, K.: You are where you tweet: a content-based approach to geo-locating twitter users. In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management, CIKM ’10, New York, NY, USA, pp. 759–768. ACM. ISBN 978-1-4503-0099-5. https://​doi.​org/​10.​1145/​1871437.​1871535 (2010)
7.
Zurück zum Zitat Chu, Z., Gianvecchio, S., Wang, H., Jajodia, S.: Who is tweeting on twitter: human, bot, or cyborg? In: Proceedings of the 26th Annual Computer Security Applications Conference, ACSAC ’10, New York, NY, USA, pp. 21–30. ACM. ISBN 978-1-4503-0133-6. https://doi.org/10.1145/1920261.1920265 (2010) Chu, Z., Gianvecchio, S., Wang, H., Jajodia, S.: Who is tweeting on twitter: human, bot, or cyborg? In: Proceedings of the 26th Annual Computer Security Applications Conference, ACSAC ’10, New York, NY, USA, pp. 21–30. ACM. ISBN 978-1-4503-0133-6. https://​doi.​org/​10.​1145/​1920261.​1920265 (2010)
9.
Zurück zum Zitat Daas, P.J., Burger, J., Le, Q., ten Bosch, O., Puts, M.J.: Profiling of Twitter Users: A Big Data Selectivity Study (2016) Daas, P.J., Burger, J., Le, Q., ten Bosch, O., Puts, M.J.: Profiling of Twitter Users: A Big Data Selectivity Study (2016)
11.
Zurück zum Zitat Gurajala, S., White, J.S., Hudson, B., Matthews, J.N.: Fake twitter accounts: profile characteristics obtained using an activity-based pattern detection approach. In: Proceedings of the 2015 International Conference on Social Media & Society, SMSociety ’15, New York, NY, USA, pp. 9:1–9:7. ACM. ISBN 978-1-4503-3923-0. https://doi.org/10.1145/2789187.2789206 (2015) Gurajala, S., White, J.S., Hudson, B., Matthews, J.N.: Fake twitter accounts: profile characteristics obtained using an activity-based pattern detection approach. In: Proceedings of the 2015 International Conference on Social Media & Society, SMSociety ’15, New York, NY, USA, pp. 9:1–9:7. ACM. ISBN 978-1-4503-3923-0. https://​doi.​org/​10.​1145/​2789187.​2789206 (2015)
12.
Zurück zum Zitat Huang, W., Weber, I., Vieweg, S.: Inferring nationalities of twitter users and studying inter-national linking. In: Proceedings of the 25th ACM Conference on Hypertext and Social Media, HT ’14, New York, NY, USA, pp. 237–242. ACM. ISBN 978-1-4503-2954-5. https://doi.org/10.1145/2631775.2631825 (2014) Huang, W., Weber, I., Vieweg, S.: Inferring nationalities of twitter users and studying inter-national linking. In: Proceedings of the 25th ACM Conference on Hypertext and Social Media, HT ’14, New York, NY, USA, pp. 237–242. ACM. ISBN 978-1-4503-2954-5. https://​doi.​org/​10.​1145/​2631775.​2631825 (2014)
14.
Zurück zum Zitat Ikeda, K., Hattori, G., Matsumoto, K., Ono, C., Higashino, T.: Demographic estimation of twitter users for marketing analysis. IPSJ Trans. Consum. Devices Syst. 2(1), 82–93 (2012) Ikeda, K., Hattori, G., Matsumoto, K., Ono, C., Higashino, T.: Demographic estimation of twitter users for marketing analysis. IPSJ Trans. Consum. Devices Syst. 2(1), 82–93 (2012)
17.
Zurück zum Zitat Lee, K., Caverlee, J., Webb, S.: Uncovering social spammers: Social honeypots + machine learning. In Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’10, pp. 435–442, New York, NY, USA. ACM (2010). ISBN 978-1-4503-0153-4. https://doi.org/10.1145/1835449.1835522 Lee, K., Caverlee, J., Webb, S.: Uncovering social spammers: Social honeypots + machine learning. In Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’10, pp. 435–442, New York, NY, USA. ACM (2010). ISBN 978-1-4503-0153-4. https://​doi.​org/​10.​1145/​1835449.​1835522
18.
Zurück zum Zitat Liu, W., Ruths, D.: What’s in a name? using first names as features for gender inference in twitter. In: AAAI spring symposium: Analyzing microtext, vol. 13, p. 01 (2013) Liu, W., Ruths, D.: What’s in a name? using first names as features for gender inference in twitter. In: AAAI spring symposium: Analyzing microtext, vol. 13, p. 01 (2013)
19.
Zurück zum Zitat Mislove, A., Jørgensen, S., Ahn, Y.-Y., Onnela, J.-P., Rosenquist, J.: Understanding the demographics of twitter users, pp. 554–557. AAAI Press (2011). ISBN 978-1-57735-505-2 Mislove, A., Jørgensen, S., Ahn, Y.-Y., Onnela, J.-P., Rosenquist, J.: Understanding the demographics of twitter users, pp. 554–557. AAAI Press (2011). ISBN 978-1-57735-505-2
20.
Zurück zum Zitat Mohammady, E., Culotta, A.: Using county demographics to infer attributes of twitter users. ACL 2014, 7 (2014) Mohammady, E., Culotta, A.: Using county demographics to infer attributes of twitter users. ACL 2014, 7 (2014)
21.
Zurück zum Zitat Nguyen, D., Smith, N.A., Rosé, C.P.: Author age prediction from text using linear regression. In: Proceedings of the 5th ACL-HLT Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities, LaTeCH ’11, pp. 115–123, Stroudsburg, PA, USA, 2011. Association for Computational Linguistics. ISBN 9781937284046. http://dl.acm.org/citation.cfm?id=2107636.2107651 Nguyen, D., Smith, N.A., Rosé, C.P.: Author age prediction from text using linear regression. In: Proceedings of the 5th ACL-HLT Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities, LaTeCH ’11, pp. 115–123, Stroudsburg, PA, USA, 2011. Association for Computational Linguistics. ISBN 9781937284046. http://​dl.​acm.​org/​citation.​cfm?​id=​2107636.​2107651
22.
Zurück zum Zitat Paquet-Clouston, M., Bilodeau, O., Décary-Hétu, D.: Can we trust social media data?: Social network manipulation by an iot botnet. In: Proceedings of the 8th International Conference on Social Media & Society, #SMSociety17, pp. 15:1–15:9, New York, NY, USA. ACM. ISBN 978-1-4503-4847-8. https://doi.org/10.1145/3097286.3097301 Paquet-Clouston, M., Bilodeau, O., Décary-Hétu, D.: Can we trust social media data?: Social network manipulation by an iot botnet. In: Proceedings of the 8th International Conference on Social Media & Society, #SMSociety17, pp. 15:1–15:9, New York, NY, USA. ACM. ISBN 978-1-4503-4847-8. https://​doi.​org/​10.​1145/​3097286.​3097301
23.
Zurück zum Zitat Pennacchiotti, M., Popescu, A.-M.: A machine learning approach to twitter user classification. In: ICWSM (2011) Pennacchiotti, M., Popescu, A.-M.: A machine learning approach to twitter user classification. In: ICWSM (2011)
25.
Zurück zum Zitat Rao, D., Yarowsky, D., Shreevats, A., Gupta, M.: Classifying latent user attributes in twitter. In: Proceedings of the 2Nd International Workshop on Search and Mining User-generated Contents, SMUC ’10, pp. 37–44, New York, NY, USA. ACM. ISBN 978-1-4503-0386-6. https://doi.org/10.1145/1871985.1871993 (2010) Rao, D., Yarowsky, D., Shreevats, A., Gupta, M.: Classifying latent user attributes in twitter. In: Proceedings of the 2Nd International Workshop on Search and Mining User-generated Contents, SMUC ’10, pp. 37–44, New York, NY, USA. ACM. ISBN 978-1-4503-0386-6. https://​doi.​org/​10.​1145/​1871985.​1871993 (2010)
27.
Zurück zum Zitat Sakaki, S., Miura, Y., Ma, X., Hattori, K., Ohkuma, T.: Twitter user gender inference using combined analysis of text and image processing. V&L Net 2014, 54 (2014) Sakaki, S., Miura, Y., Ma, X., Hattori, K., Ohkuma, T.: Twitter user gender inference using combined analysis of text and image processing. V&L Net 2014, 54 (2014)
28.
Zurück zum Zitat Schwartz, H.A., Eichstaedt, J.C., Kern, M.L., Dziurzynski, L., Lucas, R.E., Agrawal, M., Park, G.J., Lakshmikanth, S.K., Jha, S., Seligman, M.E. et al.: Characterizing geographic variation in well-being using tweets. In: ICWSM (2013) Schwartz, H.A., Eichstaedt, J.C., Kern, M.L., Dziurzynski, L., Lucas, R.E., Agrawal, M., Park, G.J., Lakshmikanth, S.K., Jha, S., Seligman, M.E. et al.: Characterizing geographic variation in well-being using tweets. In: ICWSM (2013)
Metadaten
Titel
Who Tweets in Italian? Demographic Characteristics of Twitter Users
verfasst von
Righi Alessandra
Mauro M. Gentile
Domenico M. Bianco
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-030-21158-5_25