Skip to main content

2022 | OriginalPaper | Buchkapitel

2. First Step: Working with Survey Data

verfasst von : Walter R. Paczkowski

Erschienen in: Modern Survey Analysis

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

You cannot do basic survey data analysis or any type of data analysis, whether it be for surveys or not, without understanding the structure of your data. For surveys, this means at least understanding the background of your respondents: their gender, age, education, and so forth. This amounts to understanding respondents’ profiles. Examples include age distribution, gender distribution, income distribution, political party affiliation distribution, and residency distribution, to mention just a few. Profiles provide a perspective on how your respondents answer the main survey questions; different groups answer differently. But your data have to be organized to allow you to do this. In this chapter, you will gain a perspective on how to organize your data to prepare to look at the basic distributions of your respondents. You will then begin to look at your data in the next chapter.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
The packages are sometimes referred to as libraries and modules. I will use packages.
 
2
Conda can be run from the Anaconda Navigator. See https://​www.​anaconda.​com/​.
 
3
There is a way around this, but I do not recommend that you not specify the package name. The reason is simple: There may be two or more functions with the same name.
 
4
Notice the percent sign before the command run and the use of double quotes. From the IPython documentation: “On Windows systems, the use of single quotes ‘when specifying a file is not supported. Use double quotes ”.
 
5
I have not done an exhaustive check of all available statistical software packages, but based on my experience with a large number of them, especially all the major ones, I believe this statement is correct.
 
6
The word “dictionary” will be used several times in the next discussions, so it will be an overworked word. The correct usage has to be inferred from context.
 
7
These descriptions follow Paczkowski (2016). Used with permission of SAS.
 
8
The integer is an 8-bit integer, or 1 byte. This means that this is the simplest representation of the categories, which has definite memory-saving implications.
 
9
The order when sort = False is used is based on a hash table and so is in an arbitrary order. See StackOverFlow article at https://​stackoverflow.​com/​questions/​33661295/​pandas-value-countssort-false-with-large-series-doesnt-work.
 
10
“Other” is defined as Public Health Service, Environmental Services Administration, National Oceanic and Atmospheric Administration, and US Merchant Marine.
 
Literatur
Zurück zum Zitat Agresti, A. 2002. Categorical Data Analysis. 2nd ed. New York: Wiley.CrossRef Agresti, A. 2002. Categorical Data Analysis. 2nd ed. New York: Wiley.CrossRef
Zurück zum Zitat Bachman, J.G., and P.M. O’Malley. 1984. Yea-Saying, Nay-Saying, and Going to Extremes: Black-White Differences in Response Styles. The Public Opinion Quarterly 48 (2): 491–509.CrossRef Bachman, J.G., and P.M. O’Malley. 1984. Yea-Saying, Nay-Saying, and Going to Extremes: Black-White Differences in Response Styles. The Public Opinion Quarterly 48 (2): 491–509.CrossRef
Zurück zum Zitat Bethlehem, J.G. 2002. Survey Nonresponse. In Chapter Weighting Nonresponse Adjustments Based on Auxiliary Information, 275–287. New York: Wiley. Bethlehem, J.G. 2002. Survey Nonresponse. In Chapter Weighting Nonresponse Adjustments Based on Auxiliary Information, 275–287. New York: Wiley.
Zurück zum Zitat Deming, W.E. 1943. Statistical Adjustment of Data. New York: Dover Publications, Inc.MATH Deming, W.E. 1943. Statistical Adjustment of Data. New York: Dover Publications, Inc.MATH
Zurück zum Zitat Deming, W.E. and F.F. Stephan. 1940. On a least squares adjustment of a sampled frequency table when the expected marginaltotals are known. The Annals of Mathematical Statistics 11 (4): 427–444.CrossRef Deming, W.E. and F.F. Stephan. 1940. On a least squares adjustment of a sampled frequency table when the expected marginaltotals are known. The Annals of Mathematical Statistics 11 (4): 427–444.CrossRef
Zurück zum Zitat Dorofeev, S. and P. Grant. 2006. Statistics for Real-Life Sample Surveys. Cambridge: Cambridge University Press.CrossRef Dorofeev, S. and P. Grant. 2006. Statistics for Real-Life Sample Surveys. Cambridge: Cambridge University Press.CrossRef
Zurück zum Zitat Enders, C.K. 2010. Applied Missing Data Analysis. New York: The Guilford Press. Enders, C.K. 2010. Applied Missing Data Analysis. New York: The Guilford Press.
Zurück zum Zitat Fischer, R. 2004. Standardization to account for cross-cultural response bias: A classification of score adjustment procedures and review of research in JCCP. Journal of Cross-cultural Psychology 35 (3): 263–282.CrossRef Fischer, R. 2004. Standardization to account for cross-cultural response bias: A classification of score adjustment procedures and review of research in JCCP. Journal of Cross-cultural Psychology 35 (3): 263–282.CrossRef
Zurück zum Zitat Gelman, A. and J.B. Carlin. 2002. Survey Nonresponse. In Chapter Poststratification and Weighting and Weighting Adjustments, 289–302. New York: Wiley. Gelman, A. and J.B. Carlin. 2002. Survey Nonresponse. In Chapter Poststratification and Weighting and Weighting Adjustments, 289–302. New York: Wiley.
Zurück zum Zitat Groves, R.M., D.A. Dillman, J.L. Eltinge, and R.J.A. Little, eds. 2002. Survey Nonresponse. New York: Wiley. Groves, R.M., D.A. Dillman, J.L. Eltinge, and R.J.A. Little, eds. 2002. Survey Nonresponse. New York: Wiley.
Zurück zum Zitat Hicks, L.E. 1970. Some properties of ipsative, normative, and forced-choice normative measures. Psychological Bulletin 74 (3): 167–184.MathSciNetCrossRef Hicks, L.E. 1970. Some properties of ipsative, normative, and forced-choice normative measures. Psychological Bulletin 74 (3): 167–184.MathSciNetCrossRef
Zurück zum Zitat Hunt, J. 2020. A Beginners Guide to Python3 Programming. Berlin: Springer. Hunt, J. 2020. A Beginners Guide to Python3 Programming. Berlin: Springer.
Zurück zum Zitat Liu, M. 2015. Response Style and Rating Scales:The Effects of Data Collection Mode, Scale Format, and Acculturation. phdthesis, Michigan: The University of Michigan. Liu, M. 2015. Response Style and Rating Scales:The Effects of Data Collection Mode, Scale Format, and Acculturation. phdthesis, Michigan: The University of Michigan.
Zurück zum Zitat McKinney, W. 2018. Python for Data Analysis: Data Wrangling with Pandas, Numpy, and ipython, 2nd ed. Newton: O’Reilly. McKinney, W. 2018. Python for Data Analysis: Data Wrangling with Pandas, Numpy, and ipython, 2nd ed. Newton: O’Reilly.
Zurück zum Zitat Paczkowski, W.R. 2016. Market Data Analysis Using JMP. New York: SAS Press. Paczkowski, W.R. 2016. Market Data Analysis Using JMP. New York: SAS Press.
Zurück zum Zitat Paczkowski, W.R. 2018. Pricing Analytics: Models and Advanced Quantitative Techniques for Product Pricing. Milton Park: Routledge.CrossRef Paczkowski, W.R. 2018. Pricing Analytics: Models and Advanced Quantitative Techniques for Product Pricing. Milton Park: Routledge.CrossRef
Zurück zum Zitat Paczkowski, W.R. 2020. Deep Data Analytics for New Product Development. Milton Park: Routledge.CrossRef Paczkowski, W.R. 2020. Deep Data Analytics for New Product Development. Milton Park: Routledge.CrossRef
Zurück zum Zitat Paczkowski, W.R. 2022. Business Analytics: Data Science for Business Problems. Berlin: Springer. Paczkowski, W.R. 2022. Business Analytics: Data Science for Business Problems. Berlin: Springer.
Zurück zum Zitat Pagolu, M.K. and G. Chakraborty. 2011. Eliminating response style segments in survey data via double standardization before clustering. In SAS Global Forum 2011. Data Mining and Text Analytics; Paper 165-2011. Pagolu, M.K. and G. Chakraborty. 2011. Eliminating response style segments in survey data via double standardization before clustering. In SAS Global Forum 2011. Data Mining and Text Analytics; Paper 165-2011.
Zurück zum Zitat Potter, F. and Y. Zheng. 2015. Methods and issues in trimming extreme weights in sample surveys. In Proceedings of the Survey Research Methods Section, 2707–2719. New York: American Statistical Association. Potter, F. and Y. Zheng. 2015. Methods and issues in trimming extreme weights in sample surveys. In Proceedings of the Survey Research Methods Section, 2707–2719. New York: American Statistical Association.
Zurück zum Zitat Safir, A. 2008. Check all that apply. In Encyclopedia of Survey Research Methods, ed. Paul J. Lavrakas, 95. New York: SAGE Publications, Inc. Safir, A. 2008. Check all that apply. In Encyclopedia of Survey Research Methods, ed. Paul J. Lavrakas, 95. New York: SAGE Publications, Inc.
Zurück zum Zitat Sedgewick, R., K. Wayne, and R. Dondero. 2016. Inroduction to Python Programming: An Interdisciplinary Approach. London: Pearson. Sedgewick, R., K. Wayne, and R. Dondero. 2016. Inroduction to Python Programming: An Interdisciplinary Approach. London: Pearson.
Zurück zum Zitat Voss, D.S., A. Gelman, and G. King. 1995. Preelection survey methodology: Details from eight polling organizations, 1988 and 1992. The Public Opinion Quarterly 59 (1): 98–132.CrossRef Voss, D.S., A. Gelman, and G. King. 1995. Preelection survey methodology: Details from eight polling organizations, 1988 and 1992. The Public Opinion Quarterly 59 (1): 98–132.CrossRef
Metadaten
Titel
First Step: Working with Survey Data
verfasst von
Walter R. Paczkowski
Copyright-Jahr
2022
DOI
https://doi.org/10.1007/978-3-030-76267-4_2