Skip to main content
Top
Published in: International Journal on Software Tools for Technology Transfer 3/2015

01-06-2015 | Regular Paper

Workflows for quantitative data analysis in the social sciences

Authors: Kenneth J. Turner, Paul S. Lambert

Published in: International Journal on Software Tools for Technology Transfer | Issue 3/2015

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The background is given as to how statistical analysis is used by quantitative social scientists. Developing statistical analyses requires substantial effort, yet there are important limitations in current practice. This has motivated the authors to create a more systematic and effective methodology with supporting tools. The approach to modelling quantitative data analysis in the social sciences is presented. Analysis scripts are treated abstractly as mathematical functions and concretely as web services. This allows individual scripts to be combined into high-level workflows. A comprehensive set of tools allows workflows to be defined, automatically validated and verified, and automatically implemented. The workflows expose opportunities for parallel execution, can define support for proper fault handling, and can be realised by non-technical users. Services, workflows and datasets can also be readily shared. The approach is illustrated with a realistic case study that analyses occupational position in relation to health.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Arkin, A., Askary, S., Bloch, B., Curbera, F., Goland, Y., Kartha, N., Lie, C.K., Thatte, S., Yendluri, P., Yiu, A. (eds.) Web Services Business Process Execution Language. Version 2.0. Organization for The Advancement of Structured Information Standards, Billerica (2007) Arkin, A., Askary, S., Bloch, B., Curbera, F., Goland, Y., Kartha, N., Lie, C.K., Thatte, S., Yendluri, P., Yiu, A. (eds.) Web Services Business Process Execution Language. Version 2.0. Organization for The Advancement of Structured Information Standards, Billerica (2007)
2.
go back to reference Armbrust, M., Fox, A., Griffith, R., Joseph, A.D., Katz, R., Konwinski, A., Lee, G., Paterson, D., Rabkin, A., Stoica, I., Zaharaia, M.: A view of cloud computing. Commun. ACM 53(4), 50–58 (Apr. 2010) Armbrust, M., Fox, A., Griffith, R., Joseph, A.D., Katz, R., Konwinski, A., Lee, G., Paterson, D., Rabkin, A., Stoica, I., Zaharaia, M.: A view of cloud computing. Commun. ACM 53(4), 50–58 (Apr. 2010)
3.
go back to reference Bethlehem, J.: Surveys without questions. In: De Leeuw, E., Hox, J., Dillman, D.A. (eds.) International Handbook of Survey Methodology, chap. 26, pp. 500–511. Psychology Press, London (2008) Bethlehem, J.: Surveys without questions. In: De Leeuw, E., Hox, J., Dillman, D.A. (eds.) International Handbook of Survey Methodology, chap. 26, pp. 500–511. Psychology Press, London (2008)
4.
go back to reference Blank, G., Rasmussen, K.B.: The Data Documentation Initiative: the value and significance of a worldwide standard. Soc. Sci. Comput. Rev. 22(3) (2004) Blank, G., Rasmussen, K.B.: The Data Documentation Initiative: the value and significance of a worldwide standard. Soc. Sci. Comput. Rev. 22(3) (2004)
5.
go back to reference Bradfield, J., Stirling, C.: Modal mu-calculi. In: Blackburn, P., van Benthem, J., Wolter, F. (eds.) Handbook of Modal Logic. Elsevier Science Publishers, Amsterdam (2007) Bradfield, J., Stirling, C.: Modal mu-calculi. In: Blackburn, P., van Benthem, J., Wolter, F. (eds.) Handbook of Modal Logic. Elsevier Science Publishers, Amsterdam (2007)
6.
go back to reference Browne, W.J., Cameron, B., Charlton, C.M.J., Michaelides, D.T., Parker, R.M.A., Szmaragd, C., Yang, H., Zhang, Z.: A beginner’s guide to Stat-JR. University of Bristol, Centre for Multilevel Modelling, Bristol (2012) Browne, W.J., Cameron, B., Charlton, C.M.J., Michaelides, D.T., Parker, R.M.A., Szmaragd, C., Yang, H., Zhang, Z.: A beginner’s guide to Stat-JR. University of Bristol, Centre for Multilevel Modelling, Bristol (2012)
7.
go back to reference Butler, M., Ferreira, C., Ng, M.Y.: Specifying and verifying web transactions. Univ. Comput. Sci. 11(5), 712–743 (2005) Butler, M., Ferreira, C., Ng, M.Y.: Specifying and verifying web transactions. Univ. Comput. Sci. 11(5), 712–743 (2005)
8.
go back to reference Chirichiello, A., Salaün, G., Encoding abstract descriptions into executable web services: towards a formal development. In: Proc. Web Intelligence. Institution of Electrical and Electronic Engineers Press, New York (2005) Chirichiello, A., Salaün, G., Encoding abstract descriptions into executable web services: towards a formal development. In: Proc. Web Intelligence. Institution of Electrical and Electronic Engineers Press, New York (2005)
9.
go back to reference de Roure, D.C., Goble, C.A., Stevens, R.: PX: a system extracting programs from proofs. In: Fox, G., Chiu, K., Buyya, R. (eds.) Proc. Int. Conf. on 3rd e-Science and Grid Computing, pp. 603–610. Institution of Electrical and Electronic Engineers Press, New York (2007) de Roure, D.C., Goble, C.A., Stevens, R.: PX: a system extracting programs from proofs. In: Fox, G., Chiu, K., Buyya, R. (eds.) Proc. Int. Conf. on 3rd e-Science and Grid Computing, pp. 603–610. Institution of Electrical and Electronic Engineers Press, New York (2007)
11.
go back to reference Ferrara, A.: Web services: a process algebra approach. In: Proc. 2nd International Conference on Service-Oriented Computing, pp. 242–251. ACM Press, New York (2004) Ferrara, A.: Web services: a process algebra approach. In: Proc. 2nd International Conference on Service-Oriented Computing, pp. 242–251. ACM Press, New York (2004)
12.
go back to reference Foster, H.: A rigorous approach to engineering web service compositions. Ph.D. thesis, Imperial College, London (2006) Foster, H.: A rigorous approach to engineering web service compositions. Ph.D. thesis, Imperial College, London (2006)
13.
go back to reference Foster, I.: What is the grid? A three point checklist. Grid Today 1(6) (2002) Foster, I.: What is the grid? A three point checklist. Grid Today 1(6) (2002)
14.
go back to reference Freese, J.: Replication standards for quantitative social science: why not sociology? Sociol. Methods Res. 36(2), 153–171 (2007)CrossRefMathSciNet Freese, J.: Replication standards for quantitative social science: why not sociology? Sociol. Methods Res. 36(2), 153–171 (2007)CrossRefMathSciNet
15.
go back to reference Fu, X., Bultan, T., Su, J.: Analysis of interacting BPEL web services. In: Proc. 13th. International World Wide Web Conference, pp. 621–630. ACM Press, New York (2004) Fu, X., Bultan, T., Su, J.: Analysis of interacting BPEL web services. In: Proc. 13th. International World Wide Web Conference, pp. 621–630. ACM Press, New York (2004)
16.
go back to reference Ghanem, M., Guo, Y., Rowe, A., Wendel, P.: Grid-based knowledge discovery services for high throughput informatics. In: Proc. 11th Int. Symp. on High Performance Distributed Computing, pp. 198–212. Institution of Electrical and Electronic Engineers Press, New York (2002) Ghanem, M., Guo, Y., Rowe, A., Wendel, P.: Grid-based knowledge discovery services for high throughput informatics. In: Proc. 11th Int. Symp. on High Performance Distributed Computing, pp. 198–212. Institution of Electrical and Electronic Engineers Press, New York (2002)
17.
go back to reference Hey, T., Tansley, S.: In: Tolle, K. (ed.) The Fourth Paradigm: Data-Intensive Scientific Discovery. Microsoft Research, Redmond (2009) Hey, T., Tansley, S.: In: Tolle, K. (ed.) The Fourth Paradigm: Data-Intensive Scientific Discovery. Microsoft Research, Redmond (2009)
18.
go back to reference ISO/IEC. Information processing systems—open systems interconnection—LOTOS—a formal description technique based on the temporal ordering of observational behaviour. ISO/IEC 8807. International Organization for Standardization, Geneva (1989) ISO/IEC. Information processing systems—open systems interconnection—LOTOS—a formal description technique based on the temporal ordering of observational behaviour. ISO/IEC 8807. International Organization for Standardization, Geneva (1989)
19.
go back to reference Kaveh, N., Emmerich, W.: Validating distributed object and component designs. In: Bernardo, M., Inverardi, P. (eds.) Formal Methods for Software Architecture. Lecture Notes in Computer Science, vol. 2804, pp. 63–91. Springer, Berlin (2003)CrossRef Kaveh, N., Emmerich, W.: Validating distributed object and component designs. In: Bernardo, M., Inverardi, P. (eds.) Formal Methods for Software Architecture. Lecture Notes in Computer Science, vol. 2804, pp. 63–91. Springer, Berlin (2003)CrossRef
20.
go back to reference Lambert, P.S., Bihagen, E.: Stratification research and occupation-based classifications. In: Lambert, P.S., Connelly, R., Blackburn, R.M., Gayle, V. (eds.) Social Stratification: Trends and Processes, chap. 2, pp. 13–28. Ashgate, Aldershot (2012) Lambert, P.S., Bihagen, E.: Stratification research and occupation-based classifications. In: Lambert, P.S., Connelly, R., Blackburn, R.M., Gayle, V. (eds.) Social Stratification: Trends and Processes, chap. 2, pp. 13–28. Ashgate, Aldershot (2012)
21.
go back to reference Li, J., Zhu, H., He, J.: Specifying and verifying web transactions. In: Suzuki, K., Higashino, T., Yasumoto, K., El-Fakih, K. (eds.) Proc. Formal Techniques for Networked and Distributed Systems (FORTE 2008), Lecture Notes in Computer Science, vol. 5048, pp. 168–183. Springer, Berlin (2008) Li, J., Zhu, H., He, J.: Specifying and verifying web transactions. In: Suzuki, K., Higashino, T., Yasumoto, K., El-Fakih, K. (eds.) Proc. Formal Techniques for Networked and Distributed Systems (FORTE 2008), Lecture Notes in Computer Science, vol. 5048, pp. 168–183. Springer, Berlin (2008)
22.
go back to reference Long, J.S.: The Workflow of Data Analysis Using Stata. CRC Press, Boca Raton (2009) Long, J.S.: The Workflow of Data Analysis Using Stata. CRC Press, Boca Raton (2009)
23.
go back to reference Mackenbach, J.P., Stirbu, L., Roskam, A.J., Schaap, M.M., Menvielle, G., Leinsalu, M., Kunst, A.E.: Socioeconomic inequalities in health in 22 European countries. N. Engl. J. Med. 358(23), 2468–2481 (2008)CrossRef Mackenbach, J.P., Stirbu, L., Roskam, A.J., Schaap, M.M., Menvielle, G., Leinsalu, M., Kunst, A.E.: Socioeconomic inequalities in health in 22 European countries. N. Engl. J. Med. 358(23), 2468–2481 (2008)CrossRef
24.
go back to reference Margaria, T., Kubczak, C., Steffen, B.: Bio-jETI: a service integration, design and provisioning platform for orchestrated bioinformatics processes. BMC Bioinf. 9(4), 1614–1631 (2008) Margaria, T., Kubczak, C., Steffen, B.: Bio-jETI: a service integration, design and provisioning platform for orchestrated bioinformatics processes. BMC Bioinf. 9(4), 1614–1631 (2008)
25.
go back to reference McVie, S., Coxon, A.P.M., Hawkins, P., Palmer, J., Rice, R.: ESRC/SFC scoping study into quantitative methods capacity building in Scotland. Economic and Social Research Council (2008) McVie, S., Coxon, A.P.M., Hawkins, P., Palmer, J., Rice, R.: ESRC/SFC scoping study into quantitative methods capacity building in Scotland. Economic and Social Research Council (2008)
26.
go back to reference Oinn, T., Addis, M., Ferris, J., Marvin, D., Senger, M., Greenwood, M., Carver, T., Glover, K., Pocock, M.R., Wipat, A., Li, P.: Taverna: a tool for the composition and enactment of bioinformatics workflows. Bioinformatics 20(17), 3045–3054 (2004)CrossRef Oinn, T., Addis, M., Ferris, J., Marvin, D., Senger, M., Greenwood, M., Carver, T., Glover, K., Pocock, M.R., Wipat, A., Li, P.: Taverna: a tool for the composition and enactment of bioinformatics workflows. Bioinformatics 20(17), 3045–3054 (2004)CrossRef
27.
go back to reference Pautasso, C.: JOpera: an agile environment for web service composition with visual unit testing and refactoring. In: Proc. IEEE Symposium on Visual Languages and Human Centric Computing. Institution of Electrical and Electronic Engineers Press, New York (2005) Pautasso, C.: JOpera: an agile environment for web service composition with visual unit testing and refactoring. In: Proc. IEEE Symposium on Visual Languages and Human Centric Computing. Institution of Electrical and Electronic Engineers Press, New York (2005)
28.
go back to reference Qin, J., Fahringer, T. (eds.): Scientific Workflows—Programming, Optimization, and Synthesis with ASKALON and AWDL. Springer, Berlin (2012) Qin, J., Fahringer, T. (eds.): Scientific Workflows—Programming, Optimization, and Synthesis with ASKALON and AWDL. Springer, Berlin (2012)
29.
go back to reference Smith, S.N., Fisher, S.D., Heath, A.: Opportunities and challenges in the expansion of cross-national survey research. Soc. Res. Methodol. 14(6), 485–502 (2011)CrossRef Smith, S.N., Fisher, S.D., Heath, A.: Opportunities and challenges in the expansion of cross-national survey research. Soc. Res. Methodol. 14(6), 485–502 (2011)CrossRef
30.
go back to reference Steffen, B., Margaria, T., Nagel, R., Jörges, S., Kubczak, C.: Model-driven development with the jABC. In: Bin, E., Ziv, A., Ur, S. (eds.) Hardware and Software. Verification and Testing, Lecture Notes in Computer Science, vol. 4383, pp. 92–108. Springer, Berlin (2007)CrossRef Steffen, B., Margaria, T., Nagel, R., Jörges, S., Kubczak, C.: Model-driven development with the jABC. In: Bin, E., Ziv, A., Ur, S. (eds.) Hardware and Software. Verification and Testing, Lecture Notes in Computer Science, vol. 4383, pp. 92–108. Springer, Berlin (2007)CrossRef
31.
go back to reference Tannenbaum, T., Wright, D., Miller, K., Livny, M.: Condor: a distributed job scheduler. In: Gropp, W., Lusk, E., Sterling, T. (eds.) Beowulf Cluster Computing with Linux, pp. 307–350. MIT Press, Boston (2003) Tannenbaum, T., Wright, D., Miller, K., Livny, M.: Condor: a distributed job scheduler. In: Gropp, W., Lusk, E., Sterling, T. (eds.) Beowulf Cluster Computing with Linux, pp. 307–350. MIT Press, Boston (2003)
32.
go back to reference Taylor, I.J., Deelman, E., Gannon, D.B., Shields, M. (eds.): Workflows for E-Science: Scientific Workflows for Grids. Springer, Berlin (2007) Taylor, I.J., Deelman, E., Gannon, D.B., Shields, M. (eds.): Workflows for E-Science: Scientific Workflows for Grids. Springer, Berlin (2007)
33.
go back to reference Treiman, D.J.: Quantitative Data Analysis: Doing Social Research to test Ideas. Jossey Bass, New York (2009) Treiman, D.J.: Quantitative Data Analysis: Doing Social Research to test Ideas. Jossey Bass, New York (2009)
34.
go back to reference Turner, K.J.: Analysing interactive voice services. Comput. Netw. 45(5), 665–685 (2004)CrossRef Turner, K.J.: Analysing interactive voice services. Comput. Netw. 45(5), 665–685 (2004)CrossRef
35.
go back to reference Turner, K.J.: Validating feature-based specifications. Softw. Pract. Exp. 36(10), 999–1027 (2006)CrossRef Turner, K.J.: Validating feature-based specifications. Softw. Pract. Exp. 36(10), 999–1027 (2006)CrossRef
36.
go back to reference Turner, K.J.: Flexible management of smart homes. Ambient Intell. Smart Environ. 3(2), 83–110 (2011) Turner, K.J.: Flexible management of smart homes. Ambient Intell. Smart Environ. 3(2), 83–110 (2011)
37.
go back to reference Turner, K.J., Tan, K.L.L.: Rigorous development of composite grid services. Netw. Comput. Appl. 35(4), 1304–1316 (2012)CrossRef Turner, K.J., Tan, K.L.L.: Rigorous development of composite grid services. Netw. Comput. Appl. 35(4), 1304–1316 (2012)CrossRef
38.
go back to reference Wassermann, B., Emmerich, W., Butchart, B., Cameron, N., Chen, L., Patel, J.: Sedna: a BPEL-based environment for visual scientific workflow modelling. In: Taylor, I.J., Deelman, E., Gannon, D.B., Shields, M. (eds.) Workflows for E-Science, pp. 428–449. Springer, Berlin (2007)CrossRef Wassermann, B., Emmerich, W., Butchart, B., Cameron, N., Chen, L., Patel, J.: Sedna: a BPEL-based environment for visual scientific workflow modelling. In: Taylor, I.J., Deelman, E., Gannon, D.B., Shields, M. (eds.) Workflows for E-Science, pp. 428–449. Springer, Berlin (2007)CrossRef
39.
go back to reference Wirsing, M., Clark, A., Gilmore, S., Hölzl, M., Knapp, A., Koch, N., Schröder, A.: Sensoria process calculi for service-oriented computing. In: Najm, E., Pradat-Peyre, J.-F. (eds.) Proc. Formal Techniques for Networked and Distributed Systems (FORTE 2006), Lecture Notes in Computer Science, vol. 4229, pp. 24–45. Springer, Berlin (2006) Wirsing, M., Clark, A., Gilmore, S., Hölzl, M., Knapp, A., Koch, N., Schröder, A.: Sensoria process calculi for service-oriented computing. In: Najm, E., Pradat-Peyre, J.-F. (eds.) Proc. Formal Techniques for Networked and Distributed Systems (FORTE 2006), Lecture Notes in Computer Science, vol. 4229, pp. 24–45. Springer, Berlin (2006)
40.
go back to reference Yu, J., Han, J., Falcarin, P., Morisio, M.: Using temporal business rules to synthesize service composition process models. In: van Sinderen, M. (ed.) Proc. 1st Int. Workshop on Architectures, Concepts and Technologies for Service Oriented Computing, pp. 86–95. INSTICC Press, Setúbal (2007) Yu, J., Han, J., Falcarin, P., Morisio, M.: Using temporal business rules to synthesize service composition process models. In: van Sinderen, M. (ed.) Proc. 1st Int. Workshop on Architectures, Concepts and Technologies for Service Oriented Computing, pp. 86–95. INSTICC Press, Setúbal (2007)
Metadata
Title
Workflows for quantitative data analysis in the social sciences
Authors
Kenneth J. Turner
Paul S. Lambert
Publication date
01-06-2015
Publisher
Springer Berlin Heidelberg
Published in
International Journal on Software Tools for Technology Transfer / Issue 3/2015
Print ISSN: 1433-2779
Electronic ISSN: 1433-2787
DOI
https://doi.org/10.1007/s10009-014-0315-4

Other articles of this Issue 3/2015

International Journal on Software Tools for Technology Transfer 3/2015 Go to the issue

Premium Partner