Skip to main content

2020 | OriginalPaper | Buchkapitel

6. Introduction to Privacy-Preserving Data Collection and Sharing Methods for Global Health Research

verfasst von : Guanhong Miao, Hanzhi Gao, Yan Wang, Samuel S. Wu

Erschienen in: Statistical Methods for Global Health and Epidemiology

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In global health and epidemiological research, collecting and sharing data for sensitive topics, such as income, age, sex partners, drug use, HIV infection, stigma, and religion, has been a long-standing challenge. In this chapter, we introduce a range of methods for privacy-preserving data collection and sharing. After a comprehensive review of the classic randomized response techniques and related extensions, we present a new privacy-preserving data collection method capitalizing on the matrix masking theory. In addition to an introduction to the theory and principles, examples are used to illustrate the procedures in applying the method in practice.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Abernathy, J. R., Greenberg, B. G., & Horvitz, D. G. (1970). Estimates of induced abortion in urban North Carolina. Demography, 7(1), 19–29. Abernathy, J. R., Greenberg, B. G., & Horvitz, D. G. (1970). Estimates of induced abortion in urban North Carolina. Demography, 7(1), 19–29.
Zurück zum Zitat Arnab, R., & Mothupi, T. (2015). Randomized response techniques: A case study of the risky behaviors’ of students of a certain University. Model Assisted Statistics and Applications, 10(4), 421–430. Arnab, R., & Mothupi, T. (2015). Randomized response techniques: A case study of the risky behaviors’ of students of a certain University. Model Assisted Statistics and Applications, 10(4), 421–430.
Zurück zum Zitat Blair, G., Imai, K., & Zhou, Y. Y. (2015). Design and analysis of the randomized response technique. Journal of the American Statistical Association, 110(511), 1304–1319.MathSciNetMATH Blair, G., Imai, K., & Zhou, Y. Y. (2015). Design and analysis of the randomized response technique. Journal of the American Statistical Association, 110(511), 1304–1319.MathSciNetMATH
Zurück zum Zitat Boruch, R. F. (1971). Assuring confidentiality of responses in social research: A note on strategies. The American Sociologist, 6, 308–311. Boruch, R. F. (1971). Assuring confidentiality of responses in social research: A note on strategies. The American Sociologist, 6, 308–311.
Zurück zum Zitat Chaudhuri, A., & Christofides, T. C. (2013). Indirect questioning in sample surveys. New York, NY: Springer Science & Business Media.MATH Chaudhuri, A., & Christofides, T. C. (2013). Indirect questioning in sample surveys. New York, NY: Springer Science & Business Media.MATH
Zurück zum Zitat Chow, L. P., & Rider, R. V. (1972). The randomized response technique as used in the Taiwan outcome of pregnancy study. Studies in Family Planning, 3(11), 265–269. Chow, L. P., & Rider, R. V. (1972). The randomized response technique as used in the Taiwan outcome of pregnancy study. Studies in Family Planning, 3(11), 265–269.
Zurück zum Zitat Christophides, T. (2016). The classical randomized response techniques: Reading Warner (1965) and Greenberg et al. (1969). 50 years later. Data Gathering, Analysis and Protection of Privacy through Randomized Response Techniques: Qualitative and Quantitative Human Traits, Handbook of Statistics, 34, 29–41.MATH Christophides, T. (2016). The classical randomized response techniques: Reading Warner (1965) and Greenberg et al. (1969). 50 years later. Data Gathering, Analysis and Protection of Privacy through Randomized Response Techniques: Qualitative and Quantitative Human Traits, Handbook of Statistics, 34, 29–41.MATH
Zurück zum Zitat De Jong, M. G., Pieters, R., & Fox, J. P. (2010). Reducing social desirability bias through item randomized response: An application to measure underreported desires. Journal of Marketing Research, 47(1), 14–27. De Jong, M. G., Pieters, R., & Fox, J. P. (2010). Reducing social desirability bias through item randomized response: An application to measure underreported desires. Journal of Marketing Research, 47(1), 14–27.
Zurück zum Zitat Dietz, P., Ulrich, R., Dalaker, R., Striegel, H., Franke, A. G., Lieb, K., & Simon, P. (2013). Associations between physical and cognitive doping—a cross-sectional study in 2.997 triathletes. PLoS One, 8(11), e78702. Dietz, P., Ulrich, R., Dalaker, R., Striegel, H., Franke, A. G., Lieb, K., & Simon, P. (2013). Associations between physical and cognitive doping—a cross-sectional study in 2.997 triathletes. PLoS One, 8(11), e78702.
Zurück zum Zitat Donovan, J. J., Dwight, S. A., & Hurtz, G. M. (2003). An assessment of the prevalence, severity, and verifiability of entry-level applicant faking using the randomized response technique. Human Performance, 16(1), 81–106. Donovan, J. J., Dwight, S. A., & Hurtz, G. M. (2003). An assessment of the prevalence, severity, and verifiability of entry-level applicant faking using the randomized response technique. Human Performance, 16(1), 81–106.
Zurück zum Zitat Duncan, P. W., Sullivan, K. J., Behrman, A. L., Azen, S. P., Wu, S. S., Nadeau, S. E., . . . Hayden, S. K. (2011). Body-weight–supported treadmill rehabilitation after stroke. The New England Journal of Medicine, 364(21), 2026-2036. Duncan, P. W., Sullivan, K. J., Behrman, A. L., Azen, S. P., Wu, S. S., Nadeau, S. E., . . . Hayden, S. K. (2011). Body-weight–supported treadmill rehabilitation after stroke. The New England Journal of Medicine, 364(21), 2026-2036.
Zurück zum Zitat Dwork, C. (2008, April). Differential privacy: A survey of results. In International conference on theory and applications of models of computation (pp. 1–19). Berlin: Springer. Dwork, C. (2008, April). Differential privacy: A survey of results. In International conference on theory and applications of models of computation (pp. 1–19). Berlin: Springer.
Zurück zum Zitat Dwork, C., McSherry, F., Nissim, K., & Smith, A. (2006, March). Calibrating noise to sensitivity in private data analysis. In Theory of cryptography conference (pp. 265–284). Berlin: Springer. Dwork, C., McSherry, F., Nissim, K., & Smith, A. (2006, March). Calibrating noise to sensitivity in private data analysis. In Theory of cryptography conference (pp. 265–284). Berlin: Springer.
Zurück zum Zitat Edgell, S. E., Himmelfarb, S., & Duchan, K. L. (1982). Validity of forced responses in a randomized response model. Sociological Methods & Research, 11(1), 89–100. Edgell, S. E., Himmelfarb, S., & Duchan, K. L. (1982). Validity of forced responses in a randomized response model. Sociological Methods & Research, 11(1), 89–100.
Zurück zum Zitat Erlingsson, Ú., Pihur, V., & Korolova, A. (2014, November). Rappor: Randomized aggregatable privacy-preserving ordinal response. In Proceedings of the 2014 ACM SIGSAC conference on computer and communications security (pp. 1054–1067). New York, NY: ACM. Erlingsson, Ú., Pihur, V., & Korolova, A. (2014, November). Rappor: Randomized aggregatable privacy-preserving ordinal response. In Proceedings of the 2014 ACM SIGSAC conference on computer and communications security (pp. 1054–1067). New York, NY: ACM.
Zurück zum Zitat Fox, J. A., & Tracy, P. E. (1986). Randomized response: A method for sensitive surveys. Fox, J. A., & Tracy, P. E. (1986). Randomized response: A method for sensitive surveys.
Zurück zum Zitat Frank, L. E., Van den Hout, A., & Van der Heijden, P. G. M. (2009). Repeated cross-sectional randomized response data: Taking design change and self-protective responses into account. Methodology: European Journal of Research Methods for the Behavioral and Social Sciences, 5(4), 145. Frank, L. E., Van den Hout, A., & Van der Heijden, P. G. M. (2009). Repeated cross-sectional randomized response data: Taking design change and self-protective responses into account. Methodology: European Journal of Research Methods for the Behavioral and Social Sciences, 5(4), 145.
Zurück zum Zitat Gingerich, D. W. (2010). Understanding off-the-books politics: Conducting inference on the determinants of sensitive behavior with randomized response surveys. Political Analysis, 18(3), 349–380. Gingerich, D. W. (2010). Understanding off-the-books politics: Conducting inference on the determinants of sensitive behavior with randomized response surveys. Political Analysis, 18(3), 349–380.
Zurück zum Zitat Greenberg, B. G., Abul-Ela, A. L. A., Simmons, W. R., & Horvitz, D. G. (1969). The unrelated question randomized response model: Theoretical framework. Journal of the American Statistical Association, 64(326), 520–539.MathSciNet Greenberg, B. G., Abul-Ela, A. L. A., Simmons, W. R., & Horvitz, D. G. (1969). The unrelated question randomized response model: Theoretical framework. Journal of the American Statistical Association, 64(326), 520–539.MathSciNet
Zurück zum Zitat Himmelfarb, S. (2008). The multi-item randomized response technique. Sociological Methods & Research, 36(4), 495–514.MathSciNet Himmelfarb, S. (2008). The multi-item randomized response technique. Sociological Methods & Research, 36(4), 495–514.MathSciNet
Zurück zum Zitat Höglinger, M. (2016). Revealing the truth? Validating the randomized response technique for surveying sensitive topics. Doctoral dissertation, ETH Zurich. Höglinger, M. (2016). Revealing the truth? Validating the randomized response technique for surveying sensitive topics. Doctoral dissertation, ETH Zurich.
Zurück zum Zitat Horvitz, D.G., Shah, B. V., & Simmons, W. R. (1967). The unrelated randomized response model. In Proceedings of the Social Statistics Section of the American Statistical Association (pp. 65–72). Horvitz, D.G., Shah, B. V., & Simmons, W. R. (1967). The unrelated randomized response model. In Proceedings of the Social Statistics Section of the American Statistical Association (pp. 65–72).
Zurück zum Zitat John, L. K., Loewenstein, G., Acquisti, A., & Vosgerau, J. (2018). When and why randomized response techniques (fail to) elicit the truth. Organizational Behavior and Human Decision Processes, 148, 101–123. John, L. K., Loewenstein, G., Acquisti, A., & Vosgerau, J. (2018). When and why randomized response techniques (fail to) elicit the truth. Organizational Behavior and Human Decision Processes, 148, 101–123.
Zurück zum Zitat Krumpal, I. (2012). Estimating the prevalence of xenophobia and anti-Semitism in Germany: A comparison of randomized response and direct questioning. Social Science Research, 41(6), 1387–1403. Krumpal, I. (2012). Estimating the prevalence of xenophobia and anti-Semitism in Germany: A comparison of randomized response and direct questioning. Social Science Research, 41(6), 1387–1403.
Zurück zum Zitat Lara, D., García, S. G., Ellertson, C., Camlin, C., & Suárez, J. (2006). The measure of induced abortion levels in Mexico using random response technique. Sociological Methods & Research, 35(2), 279–301.MathSciNet Lara, D., García, S. G., Ellertson, C., Camlin, C., & Suárez, J. (2006). The measure of induced abortion levels in Mexico using random response technique. Sociological Methods & Research, 35(2), 279–301.MathSciNet
Zurück zum Zitat Lara, D., Strickler, J., Olavarrieta, C. D., & Ellertson, C. (2004). Measuring induced abortion in Mexico: A comparison of four methodologies. Sociological Methods & Research, 32(4), 529–558.MathSciNet Lara, D., Strickler, J., Olavarrieta, C. D., & Ellertson, C. (2004). Measuring induced abortion in Mexico: A comparison of four methodologies. Sociological Methods & Research, 32(4), 529–558.MathSciNet
Zurück zum Zitat Lee, R. M. (1993). Doing research on sensitive topics. Thousand Oaks, CA: Sage. Lee, R. M. (1993). Doing research on sensitive topics. Thousand Oaks, CA: Sage.
Zurück zum Zitat Lensvelt-Mulders, G. J., Hox, J. J., & Van Der Heijden, P. G. (2005). How to improve the efficiency of randomised response designs. Quality and Quantity, 39(3), 253–265. Lensvelt-Mulders, G. J., Hox, J. J., & Van Der Heijden, P. G. (2005). How to improve the efficiency of randomised response designs. Quality and Quantity, 39(3), 253–265.
Zurück zum Zitat Lensvelt-Mulders, G. J., Hox, J. J., Van der Heijden, P. G., & Maas, C. J. (2005). Meta-analysis of randomized response research: Thirty-five years of validation. Sociological Methods & Research, 33(3), 319–348.MathSciNet Lensvelt-Mulders, G. J., Hox, J. J., Van der Heijden, P. G., & Maas, C. J. (2005). Meta-analysis of randomized response research: Thirty-five years of validation. Sociological Methods & Research, 33(3), 319–348.MathSciNet
Zurück zum Zitat Locander, W., Sudman, S., & Bradburn, N. (1976). An investigation of interview method, threat and response distortion. Journal of the American Statistical Association, 71(354), 269–275. Locander, W., Sudman, S., & Bradburn, N. (1976). An investigation of interview method, threat and response distortion. Journal of the American Statistical Association, 71(354), 269–275.
Zurück zum Zitat Mangat, N. S. (1994). An improved randomized response strategy. Journal of the Royal Statistical Society. Series B (Methodological), 56, 93–95.MathSciNetMATH Mangat, N. S. (1994). An improved randomized response strategy. Journal of the Royal Statistical Society. Series B (Methodological), 56, 93–95.MathSciNetMATH
Zurück zum Zitat Mangat, N. S., & Singh, R. (1990). An alternative randomized response procedure. Biometrika, 77(2), 439–442.MathSciNetMATH Mangat, N. S., & Singh, R. (1990). An alternative randomized response procedure. Biometrika, 77(2), 439–442.MathSciNetMATH
Zurück zum Zitat Moriarty, M., & Wiseman, F. (1976). On the choice of a randomization technique with the randomized response model. In Proceedings of the Social Statistics Section, American Statistical Association (pp. 624–626). Moriarty, M., & Wiseman, F. (1976). On the choice of a randomization technique with the randomized response model. In Proceedings of the Social Statistics Section, American Statistical Association (pp. 624–626).
Zurück zum Zitat Rosenfeld, B., Imai, K., & Shapiro, J. N. (2016). An empirical validation study of popular survey methodologies for sensitive questions. American Journal of Political Science, 60(3), 783–802. Rosenfeld, B., Imai, K., & Shapiro, J. N. (2016). An empirical validation study of popular survey methodologies for sensitive questions. American Journal of Political Science, 60(3), 783–802.
Zurück zum Zitat Schröter, H., Studzinski, B., Dietz, P., Ulrich, R., Striegel, H., & Simon, P. (2016). A comparison of the Cheater detection and the unrelated question models: A randomized response survey on physical and cognitive doping in recreational triathletes. PloS One, 11(5), e0155765. Schröter, H., Studzinski, B., Dietz, P., Ulrich, R., Striegel, H., & Simon, P. (2016). A comparison of the Cheater detection and the unrelated question models: A randomized response survey on physical and cognitive doping in recreational triathletes. PloS One, 11(5), e0155765.
Zurück zum Zitat Striegel, H., Ulrich, R., & Simon, P. (2010). Randomized response estimates for doping and illicit drug use in elite athletes. Drug and Alcohol Dependence, 106(2-3), 230–232. Striegel, H., Ulrich, R., & Simon, P. (2010). Randomized response estimates for doping and illicit drug use in elite athletes. Drug and Alcohol Dependence, 106(2-3), 230–232.
Zurück zum Zitat Tezcan, S., & Omran, A. R. (1981). Prevalence and reporting of induced abortion in Turkey: two survey techniques. Studies in Family Planning, 12, 262–271. Tezcan, S., & Omran, A. R. (1981). Prevalence and reporting of induced abortion in Turkey: two survey techniques. Studies in Family Planning, 12, 262–271.
Zurück zum Zitat Ting, D., Fienberg, S. E., & Trottini, M. (2008). Random orthogonal matrix masking methodology for microdata release. International Journal of Information and Computer Security, 2(1), 86–105. Ting, D., Fienberg, S. E., & Trottini, M. (2008). Random orthogonal matrix masking methodology for microdata release. International Journal of Information and Computer Security, 2(1), 86–105.
Zurück zum Zitat Tourangeau, R., & Yan, T. (2007). Sensitive questions in surveys. Psychological Bulletin, 133(5), 859. Tourangeau, R., & Yan, T. (2007). Sensitive questions in surveys. Psychological Bulletin, 133(5), 859.
Zurück zum Zitat Ulrich, R., Pope, H. G., Cléret, L., Petróczi, A., Nepusz, T., Schaffer, J., … Simon, P. (2018). Doping in two elite athletics competitions assessed by randomized-response surveys. Sports Medicine, 48(1), 211–219. Ulrich, R., Pope, H. G., Cléret, L., Petróczi, A., Nepusz, T., Schaffer, J., … Simon, P. (2018). Doping in two elite athletics competitions assessed by randomized-response surveys. Sports Medicine, 48(1), 211–219.
Zurück zum Zitat Umesh, U. N., & Peterson, R. A. (1991). A critical evaluation of the randomized response method: Applications, validation, and research agenda. Sociological Methods & Research, 20(1), 104–138. Umesh, U. N., & Peterson, R. A. (1991). A critical evaluation of the randomized response method: Applications, validation, and research agenda. Sociological Methods & Research, 20(1), 104–138.
Zurück zum Zitat Van der Heijden, P. G., & van Gils, G. (1996). Some logistic regression models for randomized response data. Van der Heijden, P. G., & van Gils, G. (1996). Some logistic regression models for randomized response data.
Zurück zum Zitat Van Der Heijden, P. G., Van Gils, G., Bouts, J. A. N., & Hox, J. J. (2000). A comparison of randomized response, computer-assisted self-interview, and face-to-face direct questioning: Eliciting sensitive information in the context of welfare and unemployment benefit. Sociological Methods & Research, 28(4), 505–537. Van Der Heijden, P. G., Van Gils, G., Bouts, J. A. N., & Hox, J. J. (2000). A comparison of randomized response, computer-assisted self-interview, and face-to-face direct questioning: Eliciting sensitive information in the context of welfare and unemployment benefit. Sociological Methods & Research, 28(4), 505–537.
Zurück zum Zitat Warner, S. L. (1965). Randomized response: A survey technique for eliminating evasive answer bias. Journal of the American Statistical Association, 60(309), 63–69.MATH Warner, S. L. (1965). Randomized response: A survey technique for eliminating evasive answer bias. Journal of the American Statistical Association, 60(309), 63–69.MATH
Zurück zum Zitat Wu, S. S., Chen, S., Bhattacharjee, A., & He, Y. (2017). Collusion resistant multi-matrix masking for privacy-preserving data collection. In 2017 IEEE 3rd international conference on big data security on cloud (bigdatasecurity), IEEE international conference on high performance and smart computing (HPSC), and IEEE international conference on intelligent data and security (ids) (pp. 1–7). Beijing: IEEE. Wu, S. S., Chen, S., Bhattacharjee, A., & He, Y. (2017). Collusion resistant multi-matrix masking for privacy-preserving data collection. In 2017 IEEE 3rd international conference on big data security on cloud (bigdatasecurity), IEEE international conference on high performance and smart computing (HPSC), and IEEE international conference on intelligent data and security (ids) (pp. 1–7). Beijing: IEEE.
Zurück zum Zitat Wu, S. S., Chen, S., Burr, D. L., & Zhang, L. (2017). A new data collection technique for preserving privacy. Journal of Privacy and Confidentiality, 7(3), 99–129. Wu, S. S., Chen, S., Burr, D. L., & Zhang, L. (2017). A new data collection technique for preserving privacy. Journal of Privacy and Confidentiality, 7(3), 99–129.
Metadaten
Titel
Introduction to Privacy-Preserving Data Collection and Sharing Methods for Global Health Research
verfasst von
Guanhong Miao
Hanzhi Gao
Yan Wang
Samuel S. Wu
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-35260-8_6

Premium Partner