Skip to main content
Erschienen in: AStA Wirtschafts- und Sozialstatistisches Archiv 2-3/2016

26.07.2016 | Originalveröffentlichung

The use of Twitter data to improve small area estimates of households’ share of food consumption expenditure in Italy

verfasst von: Stefano Marchetti, Caterina Giusti, Monica Pratesi

Erschienen in: AStA Wirtschafts- und Sozialstatistisches Archiv | Ausgabe 2-3/2016

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The use of big data in many socio-economic studies has received a growing interest in the last few years. In this work we use emotional data coming from Twitter as auxiliary variable in a small area model to estimate Italian households’ share of food consumption expenditure (the proportion of food consumption expenditure on the total consumption expenditure) at provincial level. We show that the use of Twitter data has a potential in predicting our target variable. Moreover, the use of these data as auxiliary variable in the small area working model reduces the estimated mean squared error in comparison with what obtained by the same working model without the Twitter data.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
The equivalence scale is the Carbonaro scale used by ISTAT, according to which the expenditure of a family is divided by a specific coefficient depending on the household size (for example equal to 0.66 for a household with 1 member, 1.33 for a household with 3 members and up to 2.40 for a household with 7 members or more). In this way the expenditures of households of any size can be directly compared with those of households composed by two members.
 
2
This survey is a census survey, although some nonresponses can occur. Here we ignore the nonresponses and we use these data as census data.
 
3
Intended as households consisting of two or more individuals who are related by birth, marriage or adoption, although they also may include other unrelated people.
 
4
In other social studies content analysis of the texts posted using social network seems to provide acceptable predictions of the behavior of the whole population, not only of those using Twitter. Particularly, the debate is lively on the electoral predictions (Ceron et al 2015).
 
Literatur
Zurück zum Zitat Barigozzi M, Alessi L, Capasso M, Fagiolo G (2009) The distribution of households consumption-expenditure budget shares. Working papers series. European Central Bank, Frankfurt a.M. (Tech. rep.)MATH Barigozzi M, Alessi L, Capasso M, Fagiolo G (2009) The distribution of households consumption-expenditure budget shares. Working papers series. European Central Bank, Frankfurt a.M. (Tech. rep.)MATH
Zurück zum Zitat Blumenstock J, Cadamuro G, On R (2015) Predicting poverty and wealth from mobile phone metadata. Science 350:1073–1076CrossRef Blumenstock J, Cadamuro G, On R (2015) Predicting poverty and wealth from mobile phone metadata. Science 350:1073–1076CrossRef
Zurück zum Zitat Ceron A, Curini L, Iacus S (2015) Using social media to fore- cast electoral results: A review of state-of-the-art. Italian J Appl Stat 25(3):237–259 Ceron A, Curini L, Iacus S (2015) Using social media to fore- cast electoral results: A review of state-of-the-art. Italian J Appl Stat 25(3):237–259
Zurück zum Zitat Cordero C, Encina J, Lahiri P (2016) Analysis of poverty data by small area estimation. Wiley, Chichester (chap Poverty mapping for the Chilean comunas) Cordero C, Encina J, Lahiri P (2016) Analysis of poverty data by small area estimation. Wiley, Chichester (chap Poverty mapping for the Chilean comunas)
Zurück zum Zitat Curini L, Iacus S, Canova L (2015) Measuring idiosyncratic happiness through the analysis of twitter: An application to the italian case. Soc Indic Res 121(2):525–542CrossRef Curini L, Iacus S, Canova L (2015) Measuring idiosyncratic happiness through the analysis of twitter: An application to the italian case. Soc Indic Res 121(2):525–542CrossRef
Zurück zum Zitat Datta G, Lahiri P (2000) A unified measure of uncertainty of estimated best linear unbiased predictors in small area estimation problems. Stat Sin 10:613–627MathSciNetMATH Datta G, Lahiri P (2000) A unified measure of uncertainty of estimated best linear unbiased predictors in small area estimation problems. Stat Sin 10:613–627MathSciNetMATH
Zurück zum Zitat Datta G, Hall P, Mandal A (2011) Model selection and testing for the presence of small area effects, and application to area-level data. J Am Stat Assoc 106:362–374MathSciNetCrossRefMATH Datta G, Hall P, Mandal A (2011) Model selection and testing for the presence of small area effects, and application to area-level data. J Am Stat Assoc 106:362–374MathSciNetCrossRefMATH
Zurück zum Zitat Deaton A (2003) Household surveys, consumption, and the measurement of poverty. Econ Syst Res 15(2):135–159CrossRef Deaton A (2003) Household surveys, consumption, and the measurement of poverty. Econ Syst Res 15(2):135–159CrossRef
Zurück zum Zitat Decuyper A, Rutherford A, Wadhwa A, Bauer J, Krings G, Gutierrez T, Blondel V, Luengo-Oroz M (2014) Estimating food consumption and poverty indices with mobile phone data. United Nations Global Pulse, New York (Tech. rep.) Decuyper A, Rutherford A, Wadhwa A, Bauer J, Krings G, Gutierrez T, Blondel V, Luengo-Oroz M (2014) Estimating food consumption and poverty indices with mobile phone data. United Nations Global Pulse, New York (Tech. rep.)
Zurück zum Zitat Fay R, Herriot R (1979) Estimation of icome from small places: An application of James-Stein procedures to census data. J Am Stat Assoc 74:269–277MathSciNetCrossRef Fay R, Herriot R (1979) Estimation of icome from small places: An application of James-Stein procedures to census data. J Am Stat Assoc 74:269–277MathSciNetCrossRef
Zurück zum Zitat Giusti C, Marchetti S, Pratesi M, Salvati N (2012) Robust small area estimation and oversampling in the estimation of poverty indicators. Surv Res Methods 6(3):155–163 Giusti C, Marchetti S, Pratesi M, Salvati N (2012) Robust small area estimation and oversampling in the estimation of poverty indicators. Surv Res Methods 6(3):155–163
Zurück zum Zitat Giusti C, Masserini L, Pratesi M (2016) Local comparisons of small area estimates of poverty: an application within the tuscany region in italy. Soc Indic Res. doi:10.1007/s11205-015-1193-1 Giusti C, Masserini L, Pratesi M (2016) Local comparisons of small area estimates of poverty: an application within the tuscany region in italy. Soc Indic Res. doi:10.1007/s11205-015-1193-1
Zurück zum Zitat Hastie T, Pregibon D (1992) Generalized linear models. Wadsworth and Brooks/Cole, Edmonton (chap 6) Hastie T, Pregibon D (1992) Generalized linear models. Wadsworth and Brooks/Cole, Edmonton (chap 6)
Zurück zum Zitat Hidiroglou M, Singh A, Hamel M (2007) Some thoughts on small area estimation for the canadian community health survey. Statistics Canada (Internal statistics canada document) Hidiroglou M, Singh A, Hamel M (2007) Some thoughts on small area estimation for the canadian community health survey. Statistics Canada (Internal statistics canada document)
Zurück zum Zitat Horvitz D, Thompson D (1952) A generalization of sampling without replacement from a finite universe. J Am Stat Assoc 47:663–685MathSciNetCrossRefMATH Horvitz D, Thompson D (1952) A generalization of sampling without replacement from a finite universe. J Am Stat Assoc 47:663–685MathSciNetCrossRefMATH
Zurück zum Zitat ISTAT (2015) Statistiche report, la spesa per consumi delle famiglie, anno 2014. roma, 8 luglio 2015. ISTAT, Roma (Tech. rep.) ISTAT (2015) Statistiche report, la spesa per consumi delle famiglie, anno 2014. roma, 8 luglio 2015. ISTAT, Roma (Tech. rep.)
Zurück zum Zitat Jiang J, Lahiri P, Wan S, Wu C (2001) Jackknifing in the fay-herriot model with an example (unpublished manuscript) Jiang J, Lahiri P, Wan S, Wu C (2001) Jackknifing in the fay-herriot model with an example (unpublished manuscript)
Zurück zum Zitat Lechene V (2000) National Food Survey: 2000. Department for Environment Food and Rural Affairs, London (chap Income and price elasticities of demand for foods consumed in the home) Lechene V (2000) National Food Survey: 2000. Department for Environment Food and Rural Affairs, London (chap Income and price elasticities of demand for foods consumed in the home)
Zurück zum Zitat Marchetti S, Secondi L (2016) Estimates of household consumption expenditure at provincial level in italy by using small area estimation methods: Real comparisons using purchasing power parities. Soc Indic Res. doi:10.1007/s11205-016-1230-8 Marchetti S, Secondi L (2016) Estimates of household consumption expenditure at provincial level in italy by using small area estimation methods: Real comparisons using purchasing power parities. Soc Indic Res. doi:10.1007/s11205-016-1230-8
Zurück zum Zitat Marchetti S, Giusti C, Pratesi M, Salvati N, Giannotti F, Pedreschi D, Rinzivillo S, Pappalardo L, Gabrielli L (2015) Small area model-based estimators using big data sources. J Off Stat 31:263–281 Marchetti S, Giusti C, Pratesi M, Salvati N, Giannotti F, Pedreschi D, Rinzivillo S, Pappalardo L, Gabrielli L (2015) Small area model-based estimators using big data sources. J Off Stat 31:263–281
Zurück zum Zitat Meyer BD, James XS (2003) Measuring the well-being of the poor using income and consumption. J Hum Resour v38(Supplement):1180–1220CrossRef Meyer BD, James XS (2003) Measuring the well-being of the poor using income and consumption. J Hum Resour v38(Supplement):1180–1220CrossRef
Zurück zum Zitat Porter A, Holan S, Wikle C, Cressie N (2014) Spatial fay-herriot models for small area estimation with functional covariates. Spat Stat 10:27–42MathSciNetCrossRef Porter A, Holan S, Wikle C, Cressie N (2014) Spatial fay-herriot models for small area estimation with functional covariates. Spat Stat 10:27–42MathSciNetCrossRef
Zurück zum Zitat Pratesi M, Giusti C, Marchetti S (2012) Small area estimation of poverty indicators. In: Davino C, Fabbris L (eds) Survey data collection and integration. Springer, Berlin Heidelberg Pratesi M, Giusti C, Marchetti S (2012) Small area estimation of poverty indicators. In: Davino C, Fabbris L (eds) Survey data collection and integration. Springer, Berlin Heidelberg
Zurück zum Zitat Regmi A, Deepak M, Seale J, Bernstein J (2001) Changing structure of global food consumption and trade. USDA (United States Department of Agriculture) – Economic Research Service, Washington (chap Cross-country analysis of food consumption patterns) Regmi A, Deepak M, Seale J, Bernstein J (2001) Changing structure of global food consumption and trade. USDA (United States Department of Agriculture) – Economic Research Service, Washington (chap Cross-country analysis of food consumption patterns)
Zurück zum Zitat Salvati N, Giusti C, Pratesi M (2014) The use of spatial information for the estimation of poverty indicators at the small area level. Routledge, London, New York Salvati N, Giusti C, Pratesi M (2014) The use of spatial information for the estimation of poverty indicators at the small area level. Routledge, London, New York
Zurück zum Zitat Shapiro S, Wilk M (1965) An analysis of variance test for normality (complete samples). Biometrika 67:215–216MathSciNetMATH Shapiro S, Wilk M (1965) An analysis of variance test for normality (complete samples). Biometrika 67:215–216MathSciNetMATH
Zurück zum Zitat Tzavidis N, Marchetti S, Chambers R (2010) Robust estimation of small area means and quantiles. Aust New Zel J Stat 52(2):167–186MathSciNetCrossRefMATH Tzavidis N, Marchetti S, Chambers R (2010) Robust estimation of small area means and quantiles. Aust New Zel J Stat 52(2):167–186MathSciNetCrossRefMATH
Metadaten
Titel
The use of Twitter data to improve small area estimates of households’ share of food consumption expenditure in Italy
verfasst von
Stefano Marchetti
Caterina Giusti
Monica Pratesi
Publikationsdatum
26.07.2016
Verlag
Springer Berlin Heidelberg
Erschienen in
AStA Wirtschafts- und Sozialstatistisches Archiv / Ausgabe 2-3/2016
Print ISSN: 1863-8155
Elektronische ISSN: 1863-8163
DOI
https://doi.org/10.1007/s11943-016-0190-4

Weitere Artikel der Ausgabe 2-3/2016

AStA Wirtschafts- und Sozialstatistisches Archiv 2-3/2016 Zur Ausgabe

Originalveröffentlichung

Statistik, Data Science und Big Data

Originalveröffentlichung

Big Data in der Regionalstatistik