Skip to main content
Erschienen in: Business & Information Systems Engineering 5/2014

01.10.2014 | Research Paper

Taming Uncertainty in Big Data

Evidence from Social Media in Urban Areas

verfasst von: Johannes Bendler, Sebastian Wagner, Dipl.-Vw. Tobias Brandt, Prof. Dr. Dirk Neumann

Erschienen in: Business & Information Systems Engineering | Ausgabe 5/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

While the classic definition of Big Data included the dimensions volume, velocity, and variety, a fourth dimension, veracity, has recently come to the attention of researchers and practitioners. The increasing amount of user-generated data associated with the rise of social media emphasizes the need for methods to deal with the uncertainty inherent to these data sources. In this paper we address one aspect of uncertainty by developing a new methodology to establish the reliability of user-generated data based upon causal links with recurring patterns. We associate a large data set of geo-tagged Twitter messages in San Francisco with points of interest, such as bars, restaurants, or museums, within the city. This model is validated by causal relationships between a point of interest and the amount of messages in its vicinity. We subsequently analyze the behavior of these messages over time using a jackknifing procedure to identify categories of points of interest that exhibit consistent patterns over time. Ultimately, we condense this analysis into an indicator that gives evidence on the certainty of a data set based on these causal relationships and recurring patterns in temporal and spatial dimensions.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Weitere Produktempfehlungen anzeigen
Literatur
Zurück zum Zitat Du Y, Fan J, Chen J (2011) Experimental analysis of user mobility pattern in mobile social networks. In: IEEE Wireless communications and networking conference (WCNC), pp 1086–1090 Du Y, Fan J, Chen J (2011) Experimental analysis of user mobility pattern in mobile social networks. In: IEEE Wireless communications and networking conference (WCNC), pp 1086–1090
Zurück zum Zitat Ferrari L, Rosi A, Mamei M, Zambonelli F (2011) Extracting urban patterns from location-based social networks. In: Proc of the 3rd ACM SIGSPATIAL international workshop on location-based social networks (LBSN ’11). ACM, New York, pp 9–16 Ferrari L, Rosi A, Mamei M, Zambonelli F (2011) Extracting urban patterns from location-based social networks. In: Proc of the 3rd ACM SIGSPATIAL international workshop on location-based social networks (LBSN ’11). ACM, New York, pp 9–16
Zurück zum Zitat Heinrich B, Kaiser M, Klier M (2007) How to measure data quality? A metric-based approach. In: Rivard S, Webster J (eds) Proc of the 28th international conference on information systems (ICIS). Queen’s University, Montreal Heinrich B, Kaiser M, Klier M (2007) How to measure data quality? A metric-based approach. In: Rivard S, Webster J (eds) Proc of the 28th international conference on information systems (ICIS). Queen’s University, Montreal
Zurück zum Zitat Hilbert M, López P (2011) The world’s technological capacity to store, communicate, and compute information. Science 332(6025):60–65 CrossRef Hilbert M, López P (2011) The world’s technological capacity to store, communicate, and compute information. Science 332(6025):60–65 CrossRef
Zurück zum Zitat Kraut RE, Rice RE, Ronald E, Cool C, Fish RS (1998) Varieties of social influence: the role of utility and norms in the sSuccess of a new communication medium. Organization Science 9(4):437–453 CrossRef Kraut RE, Rice RE, Ronald E, Cool C, Fish RS (1998) Varieties of social influence: the role of utility and norms in the sSuccess of a new communication medium. Organization Science 9(4):437–453 CrossRef
Zurück zum Zitat Lee R, Wakamiya S, Sumiya K (2011) Discovery of unusual regional social activities using geo-tagged microblogs. World Wide Web 14(4):321–349 CrossRef Lee R, Wakamiya S, Sumiya K (2011) Discovery of unusual regional social activities using geo-tagged microblogs. World Wide Web 14(4):321–349 CrossRef
Zurück zum Zitat Liu B, Fu Y, Yao Z, Xiong H (2013) Learning geographical preferences for point-of-interest recommendation. In: Proc of the 19th ACM SIGKDD international conference on knowledge discovery and data mining (KDD ’13). ACM, pp 1043–1051, New York CrossRef Liu B, Fu Y, Yao Z, Xiong H (2013) Learning geographical preferences for point-of-interest recommendation. In: Proc of the 19th ACM SIGKDD international conference on knowledge discovery and data mining (KDD ’13). ACM, pp 1043–1051, New York CrossRef
Zurück zum Zitat Otto B, Wende K, Schmidt A, Osl P (2007) Towards a framework for corporate data quality management. In: ACIS 2007 proc Otto B, Wende K, Schmidt A, Osl P (2007) Towards a framework for corporate data quality management. In: ACIS 2007 proc
Zurück zum Zitat Sargent RP, Shepard RM, Glantz SA (2004) Reduced incidence of admissions for myocardial infarction associated with public smoking ban: before and after study. British Medical Journal 328:977–980 CrossRef Sargent RP, Shepard RM, Glantz SA (2004) Reduced incidence of admissions for myocardial infarction associated with public smoking ban: before and after study. British Medical Journal 328:977–980 CrossRef
Zurück zum Zitat Tobler WR (1970) A computer movie simulating urban growth in the Detroit region. Economic Geography 46:234–240 CrossRef Tobler WR (1970) A computer movie simulating urban growth in the Detroit region. Economic Geography 46:234–240 CrossRef
Zurück zum Zitat Wakamiya S, Lee R, Sumiya K (2011) Crowd-based urban characterization: extracting crowd behavioral patterns in urban areas from Twitter. In: Proc of the 3rd ACM SIGSPATIAL international workshop on location-based social networks (LBSN ’11). ACM, New York, pp 77–84 Wakamiya S, Lee R, Sumiya K (2011) Crowd-based urban characterization: extracting crowd behavioral patterns in urban areas from Twitter. In: Proc of the 3rd ACM SIGSPATIAL international workshop on location-based social networks (LBSN ’11). ACM, New York, pp 77–84
Zurück zum Zitat Wasserkrug S, Gal A, Etzion O (2005) A model for reasoning with uncertain rules in event composition systems. In: Proc of the 21st conference in uncertainty in artificial intelligence, Edinburgh, Scotland, UAI ’05, July 26–29, 2005. AUAI Press, Corvallis, pp 599–608 Wasserkrug S, Gal A, Etzion O (2005) A model for reasoning with uncertain rules in event composition systems. In: Proc of the 21st conference in uncertainty in artificial intelligence, Edinburgh, Scotland, UAI ’05, July 26–29, 2005. AUAI Press, Corvallis, pp 599–608
Zurück zum Zitat Wasserkrug S, Gal A, Etzion O, Turchin Y (2008) Complex event processing over uncertain data. In: Proc of the second international conference on distributed event-based systems (DEBS ’08). ACM, New York, pp 253–264 CrossRef Wasserkrug S, Gal A, Etzion O, Turchin Y (2008) Complex event processing over uncertain data. In: Proc of the second international conference on distributed event-based systems (DEBS ’08). ACM, New York, pp 253–264 CrossRef
Zurück zum Zitat Zhang X, Zhu F (2011) Group size and incentives to contribute: a natural experiment at Chinese wikipedia. The American Economic Review 101(4):1601–1615 CrossRef Zhang X, Zhu F (2011) Group size and incentives to contribute: a natural experiment at Chinese wikipedia. The American Economic Review 101(4):1601–1615 CrossRef
Metadaten
Titel
Taming Uncertainty in Big Data
Evidence from Social Media in Urban Areas
verfasst von
Johannes Bendler
Sebastian Wagner
Dipl.-Vw. Tobias Brandt
Prof. Dr. Dirk Neumann
Publikationsdatum
01.10.2014
Verlag
Springer Fachmedien Wiesbaden
Erschienen in
Business & Information Systems Engineering / Ausgabe 5/2014
Print ISSN: 2363-7005
Elektronische ISSN: 1867-0202
DOI
https://doi.org/10.1007/s12599-014-0342-4

Weitere Artikel der Ausgabe 5/2014

Business & Information Systems Engineering 5/2014 Zur Ausgabe

Editorial

Big Data

Imprint

Imprint