Skip to main content
Top
Published in:
Cover of the book

2018 | OriginalPaper | Chapter

1. Big Data in Computational Social Sciences and Humanities: An Introduction

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This chapter provides an overview of the current development of big data in the computational social sciences and humanities. It is composed of two parts. In the first part, we review works incorporating the three most frequently seen types of big data, namely geographic data, text corpus data, and social media data, that are used to conduct research on the social sciences in a wide range of fields, including anthropology, economics, finance, geography, history, linguistics, political science, psychology, public health, and mass communications. The second part of the chapter provides a panoramic view of the development of big data in the computational social sciences and humanities, including recent trends and the evoked challenges. As for the former, we review four representative cases of its timely development. They are big data finance, big data in psychology, the spatial humanities, and cloud computing. As for the latter, we present an overview of four challenges associated with big data, namely the complexity of big data or the ontology and epistemology of big data, big data search, big data simulation, and big data risk.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
Computational social sciences, as the title of this book series demonstrates, require little explanation. The term, computational humanities, however, is less popular. Gerhard Heyer distinguishes digital humanities from computational humanities as follows. The former is the creation, dissemination, and use of digital repositories, and the latter is the computer-based analysis of digital repositories using advanced computational and algorithmic methods (Biemann et al. 2014). Alternatively, “[c]omputational humanities is an emerging field that bridges the sciences and humanities with the goal of creating accurate computer simulations of historical, social, cultural, and religious events (Cruz-Neira 2003, p. 10).” See Gavin (2014) for a demonstration of the above two descriptions of computational humanities.
 
2
For the related applications of GIS to the humanities, also see Chaps. 3, 4, and 14. In fact, these four chapters can together be read as part of the spatial humanities.
 
3
For a general understanding of citizen science, also known as crowd science, and its recent development, the interested reader is referred to Cooper (2017) and Franzoni and Sauermann (2014).
 
4
Richard Thaler is the 2017 Nobel Laureate in Economics.
 
5
There is a philosophical issue as to whether machines will evolve to have their own interpretations of the text and hence develop their own emotions which are different from those of general human beings under the governance of their own culture. More positively, would machines surpass humans by demonstrating the features of positive psychology, as advocated by Martin Seligman (Seligman 2004), more successfully than humans?
 
6
There are already quite a few good references giving a panoramic guide to this fast growing field. The interested reader is referred to Liu (2015), Pozzi et al. (2016), and Cambria et al. (2017).
 
7
While there are only two chapters collected in this volume, the interested reader may find more useful references in Peterson (2016) and the excellent collections edited by Mitra and Xiang (2016). However, sentiment analysis may go further, beyond what the current literature delineates, and can be further incorporated into agent-based computational finance and give new impetus to behavioral finance (Chen and Venkatachalam 2017).
 
8
For example, for the complexity measure for sentiments, see Joshi et al. (2014); for the complexity measure for networks, see Morzy et al. (2017).
 
9
Interested readers are referred to Bauerlein (2008), Sunstein (2008), Ceron et al. (2016), Thompson (2016), Helbing et al. (2017), O’Neil (2017), and Stephens-Davidowitz and Pabon (2017).
 
10
The PTT Bulletin Board System is the largest terminal-based bulletin board system (BBS) based in Taiwan. For more information, see https://​en.​wikipedia.​org/​wiki/​PTT_​Bulletin_​Board_​System.
 
11
The conundrum has been well illustrated by the so-called adaptive market hypothesis, which endowed the efficient markets hypothesis with a dynamic and evolutionary interpretation (Lo 2004). In the vein of the agent-based fashion, the adaptive market hypothesis has been further studied in the form of the market fraction hypothesis (Chen et al. 2010).
 
12
This project is carried out within a collaboration between the Kavli Foundation, the Institute for the Interdisciplinary Study of Decision Making at New York University (NYU), and the NYU Center for Urban Science and Progress. For more details, the interested reader is referred to Azmak et al. (2015).
 
13
The current use of big data in psychology is not just exhausted by the survey presented in this chapter. The journal Psychological Methods has published a special issue on this frontier (Harlow and Oswald 2016). For other developments, the interested reader is also referred to Cheung and Jak (2016) and Jones (2016).
 
14
The representativeness heuristic is one of the heuristics that has been carefully studied by psychologists and behavioral economists, regarding how human decisions or judgments are made under uncertainty (Kahneman and Tversky 1972).
 
15
The interested reader is welcome to visit its home page: http://​apsti.​nccu.​edu.​tw/​.
 
16
For a general background of this fast-growing field, the interested reader is referred to Bodenhamer et al. (2010).
 
17
This feature can be coined as the big data paradox, namely too big to be “small.”
 
18
In the development of the computational social sciences and humanities, the role of cyborgs is often ignored. For example, in social simulation or agent-based simulation, there is a clear distinction between human agents and software agents, but their possible hybridizations are left out. See Chen et al. (2018).
 
Literature
go back to reference Azmak, O., Bayer, H., Caplin, A., Chun, M., Glimcher, P., Koonin, S., & Patrinos, A. (2015). Using Big data to understand the human condition: The Kavli HUMAN project. Big Data, 3(3), 173–188.CrossRef Azmak, O., Bayer, H., Caplin, A., Chun, M., Glimcher, P., Koonin, S., & Patrinos, A. (2015). Using Big data to understand the human condition: The Kavli HUMAN project. Big Data, 3(3), 173–188.CrossRef
go back to reference Bauerlein, M. (2008). The dumbest generation: How the digital age stupefies young Americans and jeopardizes our future (or, don’t trust anyone under 30). London: Penguin. Bauerlein, M. (2008). The dumbest generation: How the digital age stupefies young Americans and jeopardizes our future (or, don’t trust anyone under 30). London: Penguin.
go back to reference Biemann, C., Crane, G. R., Fellbaum, C. D., & Mehler, A. (2014). Computational humanities-bridging the gap between computer science and digital humanities (Dagstuhl Seminar 14301). In Dagstuhl reports (Vol. 4, No. 7). Dagstuhl: Schloss Dagstuhl-Leibniz-Zentrum für Informatik. Biemann, C., Crane, G. R., Fellbaum, C. D., & Mehler, A. (2014). Computational humanities-bridging the gap between computer science and digital humanities (Dagstuhl Seminar 14301). In Dagstuhl reports (Vol. 4, No. 7). Dagstuhl: Schloss Dagstuhl-Leibniz-Zentrum für Informatik.
go back to reference Bodenhamer, D. J., Corrigan, J., & Harris, T. M. (Eds.). (2010). The spatial humanities: GIS and the future of humanities scholarship. Bloomington: Indiana University Press. Bodenhamer, D. J., Corrigan, J., & Harris, T. M. (Eds.). (2010). The spatial humanities: GIS and the future of humanities scholarship. Bloomington: Indiana University Press.
go back to reference Cambria, E., Das, D., Bandyopadhyay, S., & Feraco, A. (Eds.). (2017). A practical guide to sentiment analysis (Vol. 5). Heidelberg: Springer. Cambria, E., Das, D., Bandyopadhyay, S., & Feraco, A. (Eds.). (2017). A practical guide to sentiment analysis (Vol. 5). Heidelberg: Springer.
go back to reference Ceron, A., Curini, L., & Iacus, S. M. (2016). Politics and Big data: Nowcasting and forecasting elections with social media. Didcot: Taylor & Francis.CrossRef Ceron, A., Curini, L., & Iacus, S. M. (2016). Politics and Big data: Nowcasting and forecasting elections with social media. Didcot: Taylor & Francis.CrossRef
go back to reference Chen, S.-H. (2008). Financial applications: Stock markets. In B. Wang (Ed.), Wiley encyclopedia of computer science and engineering (pp. 1227–1244). Hoboken: Wiley. Chen, S.-H. (2008). Financial applications: Stock markets. In B. Wang (Ed.), Wiley encyclopedia of computer science and engineering (pp. 1227–1244). Hoboken: Wiley.
go back to reference Chen, S.-H. (2013). Reasoning-based artificial agents in agent-based computational economics. In K. Nakamatsu & L. Jain (Eds.), The handbook on reasoning-based intelligent systems (pp. 575–602). Singapore: World Scientific.CrossRef Chen, S.-H. (2013). Reasoning-based artificial agents in agent-based computational economics. In K. Nakamatsu & L. Jain (Eds.), The handbook on reasoning-based intelligent systems (pp. 575–602). Singapore: World Scientific.CrossRef
go back to reference Chen, S.-H., & Venkatachalam, R. (2017). Agent-based modelling as a foundation for big data. Journal of Economic Methodology, 24(4), 362–383.CrossRef Chen, S.-H., & Venkatachalam, R. (2017). Agent-based modelling as a foundation for big data. Journal of Economic Methodology, 24(4), 362–383.CrossRef
go back to reference Chen, S. H., Kaboudan, M., & Du, Y. R. (2018). Computational economics in the era of natural computationalism. In S. H. Chen, M. Kaboudan, & Y. R. Du (Eds.), The Oxford handbook of computational economics and finance. New York: Oxford.CrossRef Chen, S. H., Kaboudan, M., & Du, Y. R. (2018). Computational economics in the era of natural computationalism. In S. H. Chen, M. Kaboudan, & Y. R. Du (Eds.), The Oxford handbook of computational economics and finance. New York: Oxford.CrossRef
go back to reference Chen, S.-H., Kampouridis, M., & Tsang, E. (2010). Microstructure dynamics and agent-based financial markets. In International workshop on multi-agent systems and agent-based simulation (pp. 121–135). Berlin: Springer. Chen, S.-H., Kampouridis, M., & Tsang, E. (2010). Microstructure dynamics and agent-based financial markets. In International workshop on multi-agent systems and agent-based simulation (pp. 121–135). Berlin: Springer.
go back to reference Clark, A. E., Flèche, S., Layard, R., Powdthavee, N., & Ward, G. (2018). The origins of happiness: The science of Well-being over the life course. Princeton: Princeton University Press. Clark, A. E., Flèche, S., Layard, R., Powdthavee, N., & Ward, G. (2018). The origins of happiness: The science of Well-being over the life course. Princeton: Princeton University Press.
go back to reference Conover, M. D., Ferrara, E., Menczer, F., & Flammini, A. (2013). The digital evolution of occupy Wall Street. PLoS One, 8(5), e64679.CrossRef Conover, M. D., Ferrara, E., Menczer, F., & Flammini, A. (2013). The digital evolution of occupy Wall Street. PLoS One, 8(5), e64679.CrossRef
go back to reference Conroy, N. J., Rubin, V. L., & Chen, Y. (2015). Automatic deception detection: Methods for finding fake news. Proceedings of the Association for Information Science and Technology, 52(1), 1–4.CrossRef Conroy, N. J., Rubin, V. L., & Chen, Y. (2015). Automatic deception detection: Methods for finding fake news. Proceedings of the Association for Information Science and Technology, 52(1), 1–4.CrossRef
go back to reference Cooper, C. (2017). Citizen science: How ordinary people are changing the face of discovery. London: Gerald Duckworth & Co. Cooper, C. (2017). Citizen science: How ordinary people are changing the face of discovery. London: Gerald Duckworth & Co.
go back to reference Cruz-Neira, C. (2003). Computational humanities: The new challenge for VR. IEEE Computer Graphics and Applications, 23(3), 10–13.CrossRef Cruz-Neira, C. (2003). Computational humanities: The new challenge for VR. IEEE Computer Graphics and Applications, 23(3), 10–13.CrossRef
go back to reference Franzoni, C., & Sauermann, H. (2014). Crowd science: The organization of scientific research in open collaborative projects. Research Policy, 43(1), 1–20.CrossRef Franzoni, C., & Sauermann, H. (2014). Crowd science: The organization of scientific research in open collaborative projects. Research Policy, 43(1), 1–20.CrossRef
go back to reference Harlow, L. L., & Oswald, F. L. (2016). Big data in psychology: Introduction to the special issue. Psychological Methods, 21(4), 447.CrossRef Harlow, L. L., & Oswald, F. L. (2016). Big data in psychology: Introduction to the special issue. Psychological Methods, 21(4), 447.CrossRef
go back to reference Jones, M. N. (Ed.). (2016). Big data in cognitive science. Hove: Psychology Press. Jones, M. N. (Ed.). (2016). Big data in cognitive science. Hove: Psychology Press.
go back to reference Joshi, A., Mishra, A., Senthamilselvan, N., & Bhattacharyya, P. (2014). Measuring sentiment annotation complexity of text. In Proceedings of the 52nd annual meeting of the Association for Computational Linguistics (volume 2: Short papers) (Vol. 2, pp. 36–41).CrossRef Joshi, A., Mishra, A., Senthamilselvan, N., & Bhattacharyya, P. (2014). Measuring sentiment annotation complexity of text. In Proceedings of the 52nd annual meeting of the Association for Computational Linguistics (volume 2: Short papers) (Vol. 2, pp. 36–41).CrossRef
go back to reference Kahneman, D., & Tversky, A. (1972). Subjective probability: A judgment of representativeness. Cognitive Psychology, 3(3), 430–454.CrossRef Kahneman, D., & Tversky, A. (1972). Subjective probability: A judgment of representativeness. Cognitive Psychology, 3(3), 430–454.CrossRef
go back to reference Kleiner, B., Stam, A., & Pekari, A. (2015). Big data for the social sciences (FORS Working Papers, 2015-2). Kleiner, B., Stam, A., & Pekari, A. (2015). Big data for the social sciences (FORS Working Papers, 2015-2).
go back to reference Lane, J., Stodden, V., Bender, S., & Nissenbaum, H. (Eds.). (2014). Privacy, big data, and the public good: Frameworks for engagement. Cambridge: Cambridge University Press. Lane, J., Stodden, V., Bender, S., & Nissenbaum, H. (Eds.). (2014). Privacy, big data, and the public good: Frameworks for engagement. Cambridge: Cambridge University Press.
go back to reference Liu, B. (2015). Sentiment analysis: Mining opinions, sentiments, and emotions. Cambridge: Cambridge University Press.CrossRef Liu, B. (2015). Sentiment analysis: Mining opinions, sentiments, and emotions. Cambridge: Cambridge University Press.CrossRef
go back to reference Loader, B. D., Vromen, A., Xenos, M. A., Steel, H., & Burgum, S. (2015). Campus politics, student societies and social media. The Sociological Review, 63(4), 820–839.CrossRef Loader, B. D., Vromen, A., Xenos, M. A., Steel, H., & Burgum, S. (2015). Campus politics, student societies and social media. The Sociological Review, 63(4), 820–839.CrossRef
go back to reference Lo, A. W. (2004). The adaptive markets hypothesis: Market efficiency from an evolutionary perspective. Journal of Portfolio Management, 30, 15–29.CrossRef Lo, A. W. (2004). The adaptive markets hypothesis: Market efficiency from an evolutionary perspective. Journal of Portfolio Management, 30, 15–29.CrossRef
go back to reference McCloskey, D. N. (1983). The rhetoric of economics. Journal of Economic Literature, 21(2), 481–517. McCloskey, D. N. (1983). The rhetoric of economics. Journal of Economic Literature, 21(2), 481–517.
go back to reference McCloskey, D. N. (1998). The rhetoric of economics. Madison: University of Wisconsin Press. McCloskey, D. N. (1998). The rhetoric of economics. Madison: University of Wisconsin Press.
go back to reference Mitra, G., & Xiang, Y. (2016). Handbook of sentiment analysis in finance. New York: Albury Books. Mitra, G., & Xiang, Y. (2016). Handbook of sentiment analysis in finance. New York: Albury Books.
go back to reference Morson, G. S., & Schapiro, M. (2017). Cents and sensibility: What economics can learn from the humanities. Princeton: Princeton University Press.CrossRef Morson, G. S., & Schapiro, M. (2017). Cents and sensibility: What economics can learn from the humanities. Princeton: Princeton University Press.CrossRef
go back to reference Morzy, M., Kajdanowicz, T., & Kazienko, P. (2017). On measuring the complexity of networks: Kolmogorov complexity versus entropy. Complexity, 2017, 3250301.MathSciNetCrossRef Morzy, M., Kajdanowicz, T., & Kazienko, P. (2017). On measuring the complexity of networks: Kolmogorov complexity versus entropy. Complexity, 2017, 3250301.MathSciNetCrossRef
go back to reference O’Neil, C. (2017). Weapons of math destruction: How big data increases inequality and threatens democracy. New York: Broadway Books.MATH O’Neil, C. (2017). Weapons of math destruction: How big data increases inequality and threatens democracy. New York: Broadway Books.MATH
go back to reference Peters, B. (2012). The big data gold rush. New York: Forbes Magazine. Peters, B. (2012). The big data gold rush. New York: Forbes Magazine.
go back to reference Peterson, R. L. (2016). Trading on sentiment: The power of minds over markets. Hoboken: Wiley.CrossRef Peterson, R. L. (2016). Trading on sentiment: The power of minds over markets. Hoboken: Wiley.CrossRef
go back to reference Pinheiro, F. L., Santos, M. D., Santos, F. C., & Pacheco, J. M. (2014). Origin of peer influence in social networks. Physical Review Letters, 112(9), 098702.CrossRef Pinheiro, F. L., Santos, M. D., Santos, F. C., & Pacheco, J. M. (2014). Origin of peer influence in social networks. Physical Review Letters, 112(9), 098702.CrossRef
go back to reference Pozzi, F. A., Fersini, E., Messina, E., & Liu, B. (2016). Sentiment analysis in social networks. Burlington: Morgan Kaufmann. Pozzi, F. A., Fersini, E., Messina, E., & Liu, B. (2016). Sentiment analysis in social networks. Burlington: Morgan Kaufmann.
go back to reference Rossbach, S. (1983). Feng Shui, the Chinese art of placement. New York: EP Dutton. Inc. Rossbach, S. (1983). Feng Shui, the Chinese art of placement. New York: EP Dutton. Inc.
go back to reference Roy, D., & Zeckhauser, R. (2016). Literary light on decision’s dark corner. In R. Frantz, S. H. Chen, K. Dopfer, F. Heukelom, & S. Mousavi (Eds.), Routledge handbook of behavioral economics (pp. 230–249). Abingdon: Routledge. Roy, D., & Zeckhauser, R. (2016). Literary light on decision’s dark corner. In R. Frantz, S. H. Chen, K. Dopfer, F. Heukelom, & S. Mousavi (Eds.), Routledge handbook of behavioral economics (pp. 230–249). Abingdon: Routledge.
go back to reference Savage, M., & Burrows, R. (2007). The coming crisis of empirical sociology. Sociology, 41(5), 885–899.CrossRef Savage, M., & Burrows, R. (2007). The coming crisis of empirical sociology. Sociology, 41(5), 885–899.CrossRef
go back to reference Seligman, M. E. (2004). Authentic happiness: Using the new positive psychology to realize your potential for lasting fulfillment. New York: Simon and Schuster. Seligman, M. E. (2004). Authentic happiness: Using the new positive psychology to realize your potential for lasting fulfillment. New York: Simon and Schuster.
go back to reference Shiller, R. J. (2017). Narrative economics. American Economic Review, 107(4), 967–1004.CrossRef Shiller, R. J. (2017). Narrative economics. American Economic Review, 107(4), 967–1004.CrossRef
go back to reference Soja, E. (2001). In different spaces: Interpreting the spatial organization of societies. In Proceedings, 3rd international space syntax symposium (p. 1-s1). Soja, E. (2001). In different spaces: Interpreting the spatial organization of societies. In Proceedings, 3rd international space syntax symposium (p. 1-s1).
go back to reference Soros, G. (2013). Fallibility, reflexivity, and the human uncertainty principle. Journal of Economic Methodology, 20(4), 309–329.CrossRef Soros, G. (2013). Fallibility, reflexivity, and the human uncertainty principle. Journal of Economic Methodology, 20(4), 309–329.CrossRef
go back to reference Stephens-Davidowitz, S., & Pabon, A. (2017). Everybody lies: Big data, new data, and what the internet can tell us about who we really are. New York: HarperLuxe. Stephens-Davidowitz, S., & Pabon, A. (2017). Everybody lies: Big data, new data, and what the internet can tell us about who we really are. New York: HarperLuxe.
go back to reference Sunstein, C. R. (2008). Neither Hayek nor Habermas. Public Choice, 134(1–2), 87–95. Sunstein, C. R. (2008). Neither Hayek nor Habermas. Public Choice, 134(1–2), 87–95.
go back to reference Thompson, A. (2016). Journalists and Trump voters live in separate online bubbles, MIT analysis shows. New York: Vice News. Thompson, A. (2016). Journalists and Trump voters live in separate online bubbles, MIT analysis shows. New York: Vice News.
go back to reference Vaillant, G. E. (2008). Aging well: Surprising guideposts to a happier life from the landmark study of adult development. Boston: Little, Brown. Vaillant, G. E. (2008). Aging well: Surprising guideposts to a happier life from the landmark study of adult development. Boston: Little, Brown.
go back to reference Webster, R. (2012). Feng Shui for beginners: Successful living by design. Woodbury: Llewellyn Worldwide. Webster, R. (2012). Feng Shui for beginners: Successful living by design. Woodbury: Llewellyn Worldwide.
go back to reference WHO-CBD. (2015). Connecting global priorities: biodiversity and human health: a state of knowledge review, p. 344. WHO-CBD. (2015). Connecting global priorities: biodiversity and human health: a state of knowledge review, p. 344.
Metadata
Title
Big Data in Computational Social Sciences and Humanities: An Introduction
Authors
Shu-Heng Chen
Tina Yu
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-319-95465-3_1

Premium Partner