Skip to main content
Erschienen in: Quality & Quantity 4/2022

05.10.2021

Ensuring survey research data integrity in the era of internet bots

verfasst von: Marybec Griffin, Richard J. Martino, Caleb LoSchiavo, Camilla Comer-Carruthers, Kristen D. Krause, Christopher B. Stults, Perry N. Halkitis

Erschienen in: Quality & Quantity | Ausgabe 4/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

We used an internet-based survey platform to conduct a cross-sectional survey regarding the impact of COVID-19 on the LGBTQ + population in the United States. While this method of data collection was quick and inexpensive, the data collected required extensive cleaning due to the infiltration of bots. Based on this experience, we provide recommendations for ensuring data integrity. Recruitment conducted between May 7 and 8, 2020 resulted in an initial sample of 1251 responses. The Qualtrics survey was disseminated via social media and professional association listservs. After noticing data discrepancies, research staff developed a rigorous data cleaning protocol. A second wave of recruitment was conducted on June 11–12, 2020 using the original recruitment methods. The five-step data cleaning protocol led to the removal of 773 (61.8%) surveys from the initial dataset, resulting in a sample of 478 participants in the first wave of data collection. The protocol led to the removal of 46 (31.9%) surveys from the second two-day wave of data collection, resulting in a sample of 98 participants in the second wave of data collection. After verifying the two-day pilot process was effective at screening for bots, the survey was reopened for a third wave of data collection resulting in a total of 709 responses, which were identified as an additional 514 (72.5%) valid participants and led to the removal of an additional 194 (27.4%) possible bots. The final analytic sample consists of 1090 participants. Although a useful and efficient research tool, especially among hard-to-reach populations, internet-based research is vulnerable to bots and mischievous responders, despite survey platforms’ built-in protections. Beyond the depletion of research funds, bot infiltration threatens data integrity and may disproportionately harm research with marginalized populations. Based on our experience, we recommend the use of strategies such as qualitative questions, duplicate demographic questions, and incentive raffles to reduce likelihood of mischievous respondents. These protections can be undertaken to ensure data integrity and facilitate research on vulnerable populations.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Augello, A., Gentile, M., Dignum, F.: An overview of open-source chatbots social skills. In: International conference on internet science, pp. 236–248. Springer, Cham (2017) Augello, A., Gentile, M., Dignum, F.: An overview of open-source chatbots social skills. In: International conference on internet science, pp. 236–248. Springer, Cham (2017)
Zurück zum Zitat Bond, K.T., Yoon, I.S., Houang, S.T., Downing, M.J., Grov, C., Hirshfield, S.: Transactional sex, substance use, and sexual risk: comparing pay direction for an internet-based US sample of men who have sex with men. Sex. Res. Soc. Policy 16(3), 255–267 (2019). https://doi.org/10.1007/s13178-018-0366-5CrossRef Bond, K.T., Yoon, I.S., Houang, S.T., Downing, M.J., Grov, C., Hirshfield, S.: Transactional sex, substance use, and sexual risk: comparing pay direction for an internet-based US sample of men who have sex with men. Sex. Res. Soc. Policy 16(3), 255–267 (2019). https://​doi.​org/​10.​1007/​s13178-018-0366-5CrossRef
Zurück zum Zitat Cimpian, J.R., Timmer, J.D., Birkett, M.A., Marro, R.L., Turner, B.C., Phillips, G.L., 2nd.: Bias from potentially mischievous responders on large-scale estimates of lesbian, gay, bisexual, or questioning (LGBQ)-heterosexual youth health disparities. Am. J. Public Health 108(S4), S258–S265 (2018). https://doi.org/10.2105/AJPH.2018.304407CrossRef Cimpian, J.R., Timmer, J.D., Birkett, M.A., Marro, R.L., Turner, B.C., Phillips, G.L., 2nd.: Bias from potentially mischievous responders on large-scale estimates of lesbian, gay, bisexual, or questioning (LGBQ)-heterosexual youth health disparities. Am. J. Public Health 108(S4), S258–S265 (2018). https://​doi.​org/​10.​2105/​AJPH.​2018.​304407CrossRef
Zurück zum Zitat Das, M., Ester, P., Kaczmirek, L. (eds.): Social and Behavioral Research and the Internet: Advances in Applied Methods and Research Strategies. Routledge, London (2018) Das, M., Ester, P., Kaczmirek, L. (eds.): Social and Behavioral Research and the Internet: Advances in Applied Methods and Research Strategies. Routledge, London (2018)
Zurück zum Zitat Dennis, S.A., Goodson, B.M., Pearson, C.A.: Online worker fraud and evolving threats to the integrity of MTurk data: a discussion of virtual private servers and the limitations of IP-based screening procedures. Behav. Res. Account. 32(1), 119–134 (2020)CrossRef Dennis, S.A., Goodson, B.M., Pearson, C.A.: Online worker fraud and evolving threats to the integrity of MTurk data: a discussion of virtual private servers and the limitations of IP-based screening procedures. Behav. Res. Account. 32(1), 119–134 (2020)CrossRef
Zurück zum Zitat Guillory, J., Wiant, K.F., Farrelly, M., Fiacco, L., Alam, I., Hoffman, L., Crankshaw, E., Delahanty, J., Alexander, T.N.: Recruiting hard-to-reach populations for survey research: using Facebook and Instagram advertisements and in-person intercept in LGBT bars and nightclubs to recruit LGBT young adults. J. Med. Internet Res. 20(6), e197 (2018). https://doi.org/10.2196/jmir.9461CrossRef Guillory, J., Wiant, K.F., Farrelly, M., Fiacco, L., Alam, I., Hoffman, L., Crankshaw, E., Delahanty, J., Alexander, T.N.: Recruiting hard-to-reach populations for survey research: using Facebook and Instagram advertisements and in-person intercept in LGBT bars and nightclubs to recruit LGBT young adults. J. Med. Internet Res. 20(6), e197 (2018). https://​doi.​org/​10.​2196/​jmir.​9461CrossRef
Zurück zum Zitat Iribarren, S.J., Ghazzawi, A., Sheinfil, A.Z., Frasca, T., Brown, W., Lopez-Rios, J., Rael, C.T., Balán, I.C., Crespo, R., Dolezal, C., Carballo-Diéguez, A.: Mixed-method evaluation of social media-based tools and traditional strategies to recruit high-risk and hard-to-reach populations into an HIV prevention intervention study. AIDS Behav. 22(1), 347–357 (2018)CrossRef Iribarren, S.J., Ghazzawi, A., Sheinfil, A.Z., Frasca, T., Brown, W., Lopez-Rios, J., Rael, C.T., Balán, I.C., Crespo, R., Dolezal, C., Carballo-Diéguez, A.: Mixed-method evaluation of social media-based tools and traditional strategies to recruit high-risk and hard-to-reach populations into an HIV prevention intervention study. AIDS Behav. 22(1), 347–357 (2018)CrossRef
Zurück zum Zitat McMaster, H.S., LeardMann, C.A., Speigle, S., Dillman, D.A., Millennium Cohort Family Study Team: An experimental comparison of web-push vs. paper-only survey procedures for conducting an in-depth health survey of military spouses. BMC Med. Res. Methodol. 17(1), 73 (2017). https://doi.org/10.1186/s12874-017-0337-1CrossRef McMaster, H.S., LeardMann, C.A., Speigle, S., Dillman, D.A., Millennium Cohort Family Study Team: An experimental comparison of web-push vs. paper-only survey procedures for conducting an in-depth health survey of military spouses. BMC Med. Res. Methodol. 17(1), 73 (2017). https://​doi.​org/​10.​1186/​s12874-017-0337-1CrossRef
Zurück zum Zitat Pozzar, R., Hammer, M.J., Underhill-Blazey, M., Wright, A.A., Tulsky, J.A., Hong, F., Gundersen, D.A., Berry, D.L.: Threats of bots and other bad actors to data quality following research participant recruitment through social media: cross-sectional questionnaire. J. Med. Internet Res. 22(10), e23021 (2020)CrossRef Pozzar, R., Hammer, M.J., Underhill-Blazey, M., Wright, A.A., Tulsky, J.A., Hong, F., Gundersen, D.A., Berry, D.L.: Threats of bots and other bad actors to data quality following research participant recruitment through social media: cross-sectional questionnaire. J. Med. Internet Res. 22(10), e23021 (2020)CrossRef
Zurück zum Zitat Russomanno, J., Patterson, J.G., Tree, J.M.J.: Social media recruitment of marginalized, hard-to-reach populations: development of recruitment and monitoring guidelines. JMIR Public Health Surveill. 5(4), e14886 (2019)CrossRef Russomanno, J., Patterson, J.G., Tree, J.M.J.: Social media recruitment of marginalized, hard-to-reach populations: development of recruitment and monitoring guidelines. JMIR Public Health Surveill. 5(4), e14886 (2019)CrossRef
Zurück zum Zitat Sanchez, T.H., Zlotorzynska, M., Sineath, R.C., Kahle, E., Tregear, S., Sullivan, P.S.: National trends in sexual behavior, substance use and HIV testing among United States men who have sex with men recruited online, 2013 through 2017. AIDS Behav. 22(8), 2413–2425 (2018). https://doi.org/10.1007/s10461-018-2168-4CrossRef Sanchez, T.H., Zlotorzynska, M., Sineath, R.C., Kahle, E., Tregear, S., Sullivan, P.S.: National trends in sexual behavior, substance use and HIV testing among United States men who have sex with men recruited online, 2013 through 2017. AIDS Behav. 22(8), 2413–2425 (2018). https://​doi.​org/​10.​1007/​s10461-018-2168-4CrossRef
Zurück zum Zitat Sterzing, P.R., Gartner, R.E., McGeough, B.L.: Conducting anonymous, incentivized, online surveys with sexual and gender minority adolescents: lessons learned from a national polyvictimization study. J. Interpers. Violence 33(5), 740–761 (2018)CrossRef Sterzing, P.R., Gartner, R.E., McGeough, B.L.: Conducting anonymous, incentivized, online surveys with sexual and gender minority adolescents: lessons learned from a national polyvictimization study. J. Interpers. Violence 33(5), 740–761 (2018)CrossRef
Zurück zum Zitat Teitcher, J.E., Bockting, W.O., Bauermeister, J.A., Hoefer, C.J., Miner, M.H., Klitzman, R.L.: Detecting, preventing, and responding to “fraudsters” in internet research: ethics and tradeoffs. J. Law Med. Ethics 43(1), 116–133 (2015). https://doi.org/10.1111/jlme.12200CrossRef Teitcher, J.E., Bockting, W.O., Bauermeister, J.A., Hoefer, C.J., Miner, M.H., Klitzman, R.L.: Detecting, preventing, and responding to “fraudsters” in internet research: ethics and tradeoffs. J. Law Med. Ethics 43(1), 116–133 (2015). https://​doi.​org/​10.​1111/​jlme.​12200CrossRef
Zurück zum Zitat Wang, Z., Qin, M., Chen, M., Jia, C.: Hiding fast flux botnet in plain email sight. In: International Conference on Security and Privacy in Communication Systems, (October), pp. 182–197. Springer, Cham (2017) Wang, Z., Qin, M., Chen, M., Jia, C.: Hiding fast flux botnet in plain email sight. In: International Conference on Security and Privacy in Communication Systems, (October), pp. 182–197. Springer, Cham (2017)
Zurück zum Zitat Yarrish, C., Groshon, L., Mitchell, D.M., Appelbaum, A., Klock, S., Winternitz, T., Friedman-Wheeler, D.G.: Finding the signal in the noise: miinimizing responses from bots and inattentive humans in online research. Behav. Ther. 42(7), 235–242 (2019) Yarrish, C., Groshon, L., Mitchell, D.M., Appelbaum, A., Klock, S., Winternitz, T., Friedman-Wheeler, D.G.: Finding the signal in the noise: miinimizing responses from bots and inattentive humans in online research. Behav. Ther. 42(7), 235–242 (2019)
Metadaten
Titel
Ensuring survey research data integrity in the era of internet bots
verfasst von
Marybec Griffin
Richard J. Martino
Caleb LoSchiavo
Camilla Comer-Carruthers
Kristen D. Krause
Christopher B. Stults
Perry N. Halkitis
Publikationsdatum
05.10.2021
Verlag
Springer Netherlands
Erschienen in
Quality & Quantity / Ausgabe 4/2022
Print ISSN: 0033-5177
Elektronische ISSN: 1573-7845
DOI
https://doi.org/10.1007/s11135-021-01252-1

Weitere Artikel der Ausgabe 4/2022

Quality & Quantity 4/2022 Zur Ausgabe

Premium Partner