Skip to main content

2023 | OriginalPaper | Buchkapitel

2. What is Data Science?

verfasst von : Orit Hazzan, Koby Mike

Erschienen in: Guide to Teaching Data Science

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Although many attempts have been made to define data science, such a definition has not yet been reached. One reason for the difficulty to reach a single, consensus definition for data science is its multifaceted nature: it can be described as a science, as a research method, as a discipline, as a workflow, or as a profession. One single definition just cannot capture this diverse essence of data science. In this chapter, we first take an interdisciplinary perspective and review the background for the development of data science (Sect. 2.1). Then we present data science from several perspectives: data science as a science (Sect. 2.2), data science as a research method (Sect. 2.3), data science as a discipline (Sect. 2.4), data science as a workflow (Sect. 2.5), and data science as a profession (Sect. 2.6). We conclude by highlighting three main characteristics of data science: interdisciplinarity, learner diversity, and its research-oriented nature (Sect. 2.7).

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
Earth image was originally posted to Flickr by DonkeyHotey at https://​flickr.​com/​photos/​47422005@N04/​5679642883. It was reviewed on 4 December 2020 by FlickreviewR 2 and was confirmed to be licensed under the terms of the cc-by-2.0.
 
Literatur
Zurück zum Zitat Al-Hashedi, K. G., & Magalingam, P. (2021). Financial fraud detection applying data mining techniques: A comprehensive review from 2009 to 2019. Computer Science Review, 40, 100402.CrossRef Al-Hashedi, K. G., & Magalingam, P. (2021). Financial fraud detection applying data mining techniques: A comprehensive review from 2009 to 2019. Computer Science Review, 40, 100402.CrossRef
Zurück zum Zitat Alvargonza, D. (2011). Multidisciplinarity interdisciplinarity transdisciplinarity and the sciences. International Studies in the Philosophy Science, 25(4), 387–403. Alvargonza, D. (2011). Multidisciplinarity interdisciplinarity transdisciplinarity and the sciences. International Studies in the Philosophy Science, 25(4), 387–403.
Zurück zum Zitat Berman, F. (co-chair), Rutenbar, R. (co-chair), Christensen, H., Davidson, S., Estrin, D., Franklin, M., Hailpern, B., Martonosi, M., Raghavan, P., Stodden, V., & Szalay, A. (2016). Realizing the potential of data science: Final report from the national science foundation computer and information science and engineering advisory committee data science working group. National Science Foundation Computer and Information Science and Engineering Advisory Committee Report, December 2016; https://www.nsf.gov/cise/ac-data-science-report/CISEACDataScienceReport1.19.17.pdf Berman, F. (co-chair), Rutenbar, R. (co-chair), Christensen, H., Davidson, S., Estrin, D., Franklin, M., Hailpern, B., Martonosi, M., Raghavan, P., Stodden, V., & Szalay, A. (2016). Realizing the potential of data science: Final report from the national science foundation computer and information science and engineering advisory committee data science working group. National Science Foundation Computer and Information Science and Engineering Advisory Committee Report, December 2016; https://​www.​nsf.​gov/​cise/​ac-data-science-report/​CISEACDataScienc​eReport1.​19.​17.​pdf
Zurück zum Zitat Berman, F., Rutenbar, R., Hailpern, B., Christensen, H., Davidson, S., Estrin, D., Franklin, M., Martonosi, M., Raghavan, P., Stodden, V., & Szalay, A. S. (2018). Realizing the potential of data science. Communications of the ACM, 61(4), 67–72. https://doi.org/10.1145/3188721CrossRef Berman, F., Rutenbar, R., Hailpern, B., Christensen, H., Davidson, S., Estrin, D., Franklin, M., Martonosi, M., Raghavan, P., Stodden, V., & Szalay, A. S. (2018). Realizing the potential of data science. Communications of the ACM, 61(4), 67–72. https://​doi.​org/​10.​1145/​3188721CrossRef
Zurück zum Zitat Cassel, B., & Topi, H. (2015). Strengthening data science education through collaboration: Workshop report 7-27-2016. Arlington, VA. Cassel, B., & Topi, H. (2015). Strengthening data science education through collaboration: Workshop report 7-27-2016. Arlington, VA.
Zurück zum Zitat Cleveland, W. S. (2001). Data science: An action plan for expanding the technical areas of the field of statistics. International Statistical Review, 69(1), 21–26.CrossRefMATH Cleveland, W. S. (2001). Data science: An action plan for expanding the technical areas of the field of statistics. International Statistical Review, 69(1), 21–26.CrossRefMATH
Zurück zum Zitat Cox, M., & Ellsworth, D. (1997). Managing big data for scientific visualization. ACM Siggraph, 97(1), 21–38. Cox, M., & Ellsworth, D. (1997). Managing big data for scientific visualization. ACM Siggraph, 97(1), 21–38.
Zurück zum Zitat Davenport, T. H., & Patil, D. (2012). Data scientist: The sexiest job of the 21st century. Harvard Business Review, 90(5), 70–76. Davenport, T. H., & Patil, D. (2012). Data scientist: The sexiest job of the 21st century. Harvard Business Review, 90(5), 70–76.
Zurück zum Zitat Donoho, D. (2017). 50 years of data science. Journal of Computational and Graphical Statistics, 26(4), 745–766.MathSciNetCrossRef Donoho, D. (2017). 50 years of data science. Journal of Computational and Graphical Statistics, 26(4), 745–766.MathSciNetCrossRef
Zurück zum Zitat Fayyad, U., Piatetsky-Shapiro, G., & Smyth, P. (1996). From data mining to knowledge discovery in databases. AI Magazine, 17(3), 37–37. Fayyad, U., Piatetsky-Shapiro, G., & Smyth, P. (1996). From data mining to knowledge discovery in databases. AI Magazine, 17(3), 37–37.
Zurück zum Zitat Grimmer, J., Roberts, M. E., & Stewart, B. M. (2021). Machine learning for social science: An agnostic approach. Annual Review of Political Science, 24, 395–419.CrossRef Grimmer, J., Roberts, M. E., & Stewart, B. M. (2021). Machine learning for social science: An agnostic approach. Annual Review of Political Science, 24, 395–419.CrossRef
Zurück zum Zitat Harris, H., Murphy, S., & Vaisman, M. (2013). Analyzing the analyzers: An introspective survey of data scientists and their work. O’Reilly Media, Inc. Harris, H., Murphy, S., & Vaisman, M. (2013). Analyzing the analyzers: An introspective survey of data scientists and their work. O’Reilly Media, Inc.
Zurück zum Zitat Hey, T., Tansley, S., Tolle, K., & Gray, J. (2009). The fourth paradigm: Data-intensive scientific discovery (vol. 1). Microsoft research Redmond. Hey, T., Tansley, S., Tolle, K., & Gray, J. (2009). The fourth paradigm: Data-intensive scientific discovery (vol. 1). Microsoft research Redmond.
Zurück zum Zitat Ishaq, A., Sadiq, S., Umer, M., Ullah, S., Mirjalili, S., Rupapara, V., & Nappi, M. (2021). Improving the prediction of heart failure patients’ survival using SMOTE and effective data mining techniques. IEEE Access, 9, 39707–39716.CrossRef Ishaq, A., Sadiq, S., Umer, M., Ullah, S., Mirjalili, S., Rupapara, V., & Nappi, M. (2021). Improving the prediction of heart failure patients’ survival using SMOTE and effective data mining techniques. IEEE Access, 9, 39707–39716.CrossRef
Zurück zum Zitat Lovell, M. C. (1983). Data mining. The Review of Economics and Statistics, 65(1), 1–12.CrossRef Lovell, M. C. (1983). Data mining. The Review of Economics and Statistics, 65(1), 1–12.CrossRef
Zurück zum Zitat Mohebbi, M., Vanderkam, D., Kodysh, J., Schonberger, R., Choi, H., & Kumar, S. (2011). Google correlate whitepaper. Mohebbi, M., Vanderkam, D., Kodysh, J., Schonberger, R., Choi, H., & Kumar, S. (2011). Google correlate whitepaper.
Zurück zum Zitat Naur, P. (1966). The science of datalogy. Communications of the ACM, 9(7), 485.CrossRef Naur, P. (1966). The science of datalogy. Communications of the ACM, 9(7), 485.CrossRef
Zurück zum Zitat Piatetsky-Shapiro, G. (1990). Knowledge discovery in real databases: A report on the IJCAI-89 workshop. AI Magazine, 11(4), 68–68. Piatetsky-Shapiro, G. (1990). Knowledge discovery in real databases: A report on the IJCAI-89 workshop. AI Magazine, 11(4), 68–68.
Zurück zum Zitat Piatetsky-Shapiro, G. (2000). Knowledge discovery in databases: 10 years after. Acm Sigkdd Explorations Newsletter, 1(2), 59–61.CrossRef Piatetsky-Shapiro, G. (2000). Knowledge discovery in databases: 10 years after. Acm Sigkdd Explorations Newsletter, 1(2), 59–61.CrossRef
Zurück zum Zitat Prebor, G. (2021). When feminism meets social networks. Library Hi Tech. Prebor, G. (2021). When feminism meets social networks. Library Hi Tech.
Zurück zum Zitat Provost, F., & Fawcett, T. (2013). Data science for business: What you need to know about data mining and data-analytic thinking. O’Reilly Media, Inc. Provost, F., & Fawcett, T. (2013). Data science for business: What you need to know about data mining and data-analytic thinking. O’Reilly Media, Inc.
Zurück zum Zitat Shearer, C. (2000). The CRISP-DM model: The new blueprint for data mining. Journal of Data Warehousing, 5(4), 13–22. Shearer, C. (2000). The CRISP-DM model: The new blueprint for data mining. Journal of Data Warehousing, 5(4), 13–22.
Zurück zum Zitat Su, Y.-S., & Wu, S.-Y. (2021). Applying data mining techniques to explore user behaviors and watching video patterns in converged IT environments. Journal of Ambient Intelligence and Humanized Computing, 1–8. Su, Y.-S., & Wu, S.-Y. (2021). Applying data mining techniques to explore user behaviors and watching video patterns in converged IT environments. Journal of Ambient Intelligence and Humanized Computing, 1–8.
Zurück zum Zitat Tukey, J. W. (1977). Exploratory data analysis. Addison-Wesley. Tukey, J. W. (1977). Exploratory data analysis. Addison-Wesley.
Zurück zum Zitat Vamathevan, J., Clark, D., Czodrowski, P., Dunham, I., Ferran, E., Lee, G., Li, B., Madabhushi, A., Shah, P., Spitzer, M., & Zhao, S. (2019). Applications of machine learning in drug discovery and development. Nature Reviews Drug Discovery, 18(6), 463–477.CrossRef Vamathevan, J., Clark, D., Czodrowski, P., Dunham, I., Ferran, E., Lee, G., Li, B., Madabhushi, A., Shah, P., Spitzer, M., & Zhao, S. (2019). Applications of machine learning in drug discovery and development. Nature Reviews Drug Discovery, 18(6), 463–477.CrossRef
Metadaten
Titel
What is Data Science?
verfasst von
Orit Hazzan
Koby Mike
Copyright-Jahr
2023
DOI
https://doi.org/10.1007/978-3-031-24758-3_2

Premium Partner