Skip to main content

2019 | OriginalPaper | Buchkapitel

Can We Assess Mental Health Through Social Media and Smart Devices? Addressing Bias in Methodology and Evaluation

verfasst von : Adam Tsakalidis, Maria Liakata, Theo Damoulas, Alexandra I. Cristea

Erschienen in: Machine Learning and Knowledge Discovery in Databases

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Predicting mental health from smartphone and social media data on a longitudinal basis has recently attracted great interest, with very promising results being reported across many studies [3, 9, 13, 26]. Such approaches have the potential to revolutionise mental health assessment, if their development and evaluation follows a real world deployment setting. In this work we take a closer look at state-of-the-art approaches, using different mental health datasets and indicators, different feature sources and multiple simulations, in order to assess their ability to generalise. We demonstrate that under a pragmatic evaluation framework, none of the approaches deliver or even approach the reported performances. In fact, we show that current state-of-the-art approaches can barely outperform the most naïve baselines in the real-world setting, posing serious questions not only about their deployment ability, but also about the contribution of the derived features for the mental health assessment task and how to make better use of such data in the future.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
For https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-10997-4_25/473908_1_En_25_IEq43_HTML.gif , this creates the https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-10997-4_25/473908_1_En_25_IEq44_HTML.gif cross-correlation issue in the MIXED/LOIOCV settings. For this reason, we ran the experiments by considering only the last entered score in a given day as our target. We did not witness any major differences that would alter our conclusions.
 
2
Accuracy is defined in [13] as follows: 5 classes are assumed (e.g., [0, ..., 4]) and the squared error e between the centre of a class halfway towards the next class is calculated (e.g., 0.25). If the squared error of a test instance is smaller than e, then it is considered as having been classified correctly.
 
3
In cases where the lowest of the top-30% scores (s) was equal to the highest of the bottom-30% scores, we excluded the instances with score s.
 
Literatur
1.
Zurück zum Zitat Bogomolov, A., Lepri, B., Ferron, M., Pianesi, F., Pentland, A.S.: Pervasive stress recognition for sustainable living. In: 2014 IEEE International Conference on Pervasive Computing and Communications Workshops (PERCOM Workshops), pp. 345–350. IEEE (2014) Bogomolov, A., Lepri, B., Ferron, M., Pianesi, F., Pentland, A.S.: Pervasive stress recognition for sustainable living. In: 2014 IEEE International Conference on Pervasive Computing and Communications Workshops (PERCOM Workshops), pp. 345–350. IEEE (2014)
2.
Zurück zum Zitat Bogomolov, A., Lepri, B., Pianesi, F.: Happiness recognition from mobile phone data. In: 2013 International Conference on Social Computing (SocialCom), pp. 790–795. IEEE (2013) Bogomolov, A., Lepri, B., Pianesi, F.: Happiness recognition from mobile phone data. In: 2013 International Conference on Social Computing (SocialCom), pp. 790–795. IEEE (2013)
3.
Zurück zum Zitat Canzian, L., Musolesi, M.: Trajectories of depression: unobtrusive monitoring of depressive states by means of smartphone mobility traces analysis. In: Proceedings of the 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing, pp. 1293–1304. ACM (2015) Canzian, L., Musolesi, M.: Trajectories of depression: unobtrusive monitoring of depressive states by means of smartphone mobility traces analysis. In: Proceedings of the 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing, pp. 1293–1304. ACM (2015)
4.
Zurück zum Zitat DeMasi, O., Kording, K., Recht, B.: Meaningless comparisons lead to false optimism in medical machine learning. PLoS One 12(9), e0184604 (2017)CrossRef DeMasi, O., Kording, K., Recht, B.: Meaningless comparisons lead to false optimism in medical machine learning. PLoS One 12(9), e0184604 (2017)CrossRef
5.
Zurück zum Zitat Farhan, A.A., et al.: Behavior vs. Introspection: refining prediction of clinical depression via smartphone sensing data. In: Wireless Health, pp. 30–37 (2016) Farhan, A.A., et al.: Behavior vs. Introspection: refining prediction of clinical depression via smartphone sensing data. In: Wireless Health, pp. 30–37 (2016)
6.
Zurück zum Zitat Gimpel, K., et al.: Part-of-speech tagging for Twitter: annotation, features, and experiments. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: Short Papers, vol. 2, pp. 42–47. Association for Computational Linguistics (2011) Gimpel, K., et al.: Part-of-speech tagging for Twitter: annotation, features, and experiments. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: Short Papers, vol. 2, pp. 42–47. Association for Computational Linguistics (2011)
7.
Zurück zum Zitat Herrman, H., Saxena, S., Moodie, R., et al.: Promoting mental health: concepts, emerging evidence, practice: a report of the world health organization, Department of Mental Health and Substance Abuse in Collaboration with the Victorian Health Promotion Foundation and the University of Melbourne. World Health Organization (2005) Herrman, H., Saxena, S., Moodie, R., et al.: Promoting mental health: concepts, emerging evidence, practice: a report of the world health organization, Department of Mental Health and Substance Abuse in Collaboration with the Victorian Health Promotion Foundation and the University of Melbourne. World Health Organization (2005)
8.
Zurück zum Zitat Hu, M., Liu, B.: Mining and summarizing customer reviews. In: Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 168–177. ACM (2004) Hu, M., Liu, B.: Mining and summarizing customer reviews. In: Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 168–177. ACM (2004)
9.
Zurück zum Zitat Jaques, N., Taylor, S., Azaria, A., Ghandeharioun, A., Sano, A., Picard, R.: Predicting students’ happiness from physiology, phone, mobility, and behavioral data. In: 2015 International Conference on Affective Computing and Intelligent Interaction (ACII), pp. 222–228. IEEE (2015) Jaques, N., Taylor, S., Azaria, A., Ghandeharioun, A., Sano, A., Picard, R.: Predicting students’ happiness from physiology, phone, mobility, and behavioral data. In: 2015 International Conference on Affective Computing and Intelligent Interaction (ACII), pp. 222–228. IEEE (2015)
10.
Zurück zum Zitat Jaques, N., Taylor, S., Sano, A., Picard, R.: Multi-task, multi-kernel learning for estimating individual wellbeing. In: Proceedings NIPS Workshop on Multimodal Machine Learning, Montreal, Quebec (2015) Jaques, N., Taylor, S., Sano, A., Picard, R.: Multi-task, multi-kernel learning for estimating individual wellbeing. In: Proceedings NIPS Workshop on Multimodal Machine Learning, Montreal, Quebec (2015)
11.
Zurück zum Zitat Kiritchenko, S., Zhu, X., Mohammad, S.M.: Sentiment analysis of short informal texts. J. Artif. Intell. Res. 50, 723–762 (2014)CrossRef Kiritchenko, S., Zhu, X., Mohammad, S.M.: Sentiment analysis of short informal texts. J. Artif. Intell. Res. 50, 723–762 (2014)CrossRef
12.
Zurück zum Zitat Kroenke, K., Strine, T.W., Spitzer, R.L., Williams, J.B., Berry, J.T., Mokdad, A.H.: The PHQ-8 as a measure of current depression in the general population. J. Affect. Disord. 114(1), 163–173 (2009)CrossRef Kroenke, K., Strine, T.W., Spitzer, R.L., Williams, J.B., Berry, J.T., Mokdad, A.H.: The PHQ-8 as a measure of current depression in the general population. J. Affect. Disord. 114(1), 163–173 (2009)CrossRef
13.
Zurück zum Zitat LiKamWa, R., Liu, Y., Lane, N.D., Zhong, L.: MoodScope: building a mood sensor from smartphone usage patterns. In: Proceeding of the 11th Annual International Conference on Mobile Systems, Applications, and Services, pp. 389–402. ACM (2013) LiKamWa, R., Liu, Y., Lane, N.D., Zhong, L.: MoodScope: building a mood sensor from smartphone usage patterns. In: Proceeding of the 11th Annual International Conference on Mobile Systems, Applications, and Services, pp. 389–402. ACM (2013)
14.
Zurück zum Zitat Ma, Y., Xu, B., Bai, Y., Sun, G., Zhu, R.: Daily mood assessment based on mobile phone sensing. In: 2012 9th International Conference on Wearable and Implantable Body Sensor Networks (BSN), pp. 142–147. IEEE (2012) Ma, Y., Xu, B., Bai, Y., Sun, G., Zhu, R.: Daily mood assessment based on mobile phone sensing. In: 2012 9th International Conference on Wearable and Implantable Body Sensor Networks (BSN), pp. 142–147. IEEE (2012)
15.
Zurück zum Zitat Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013) Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)
16.
Zurück zum Zitat Mohammad, S.: #Emotional Tweets. In: *SEM 2012: The 1st Joint Conference on Lexical and Computational Semantics - Proceedings of the Main Conference and the Shared Task, and Proceedings of the 6th International Workshop on Semantic Evaluation (SemEval 2012), vols. 1 and 2, pp. 246–255. Association for Computational Linguistics (2012) Mohammad, S.: #Emotional Tweets. In: *SEM 2012: The 1st Joint Conference on Lexical and Computational Semantics - Proceedings of the Main Conference and the Shared Task, and Proceedings of the 6th International Workshop on Semantic Evaluation (SemEval 2012), vols. 1 and 2, pp. 246–255. Association for Computational Linguistics (2012)
17.
Zurück zum Zitat Mohammad, S., Dunne, C., Dorr, B.: Generating high-coverage semantic orientation lexicons from overtly marked words and a thesaurus. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, vol. 2, pp. 599–608. Association for Computational Linguistics (2009) Mohammad, S., Dunne, C., Dorr, B.: Generating high-coverage semantic orientation lexicons from overtly marked words and a thesaurus. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, vol. 2, pp. 599–608. Association for Computational Linguistics (2009)
18.
Zurück zum Zitat Nielsen, F.Å.: A new ANEW: evaluation of a word list for sentiment analysis in microblogs. In: Workshop on ‘Making Sense of Microposts’: Big Things Come in Small Packages, pp. 93–98 (2011) Nielsen, F.Å.: A new ANEW: evaluation of a word list for sentiment analysis in microblogs. In: Workshop on ‘Making Sense of Microposts’: Big Things Come in Small Packages, pp. 93–98 (2011)
20.
Zurück zum Zitat Olesen, J., Gustavsson, A., Svensson, M., Wittchen, H.U., Jönsson, B.: The economic cost of brain disorders in Europe. Eur. J. Neurol. 19(1), 155–162 (2012)CrossRef Olesen, J., Gustavsson, A., Svensson, M., Wittchen, H.U., Jönsson, B.: The economic cost of brain disorders in Europe. Eur. J. Neurol. 19(1), 155–162 (2012)CrossRef
21.
Zurück zum Zitat Preoţiuc-Pietro, D., Volkova, S., Lampos, V., Bachrach, Y., Aletras, N.: Studying user income through language, behaviour and affect in social media. PloS One 10(9), e0138717 (2015)CrossRef Preoţiuc-Pietro, D., Volkova, S., Lampos, V., Bachrach, Y., Aletras, N.: Studying user income through language, behaviour and affect in social media. PloS One 10(9), e0138717 (2015)CrossRef
22.
Zurück zum Zitat Servia-Rodríguez, S., Rachuri, K.K., Mascolo, C., Rentfrow, P.J., Lathia, N., Sandstrom, G.M.: Mobile sensing at the service of mental well-being: a large-scale longitudinal study. In: Proceedings of the 26th International Conference on World Wide Web, pp. 103–112. International World Wide Web Conferences Steering Committee (2017) Servia-Rodríguez, S., Rachuri, K.K., Mascolo, C., Rentfrow, P.J., Lathia, N., Sandstrom, G.M.: Mobile sensing at the service of mental well-being: a large-scale longitudinal study. In: Proceedings of the 26th International Conference on World Wide Web, pp. 103–112. International World Wide Web Conferences Steering Committee (2017)
23.
Zurück zum Zitat Suhara, Y., Xu, Y., Pentland, A.: DeepMood: forecasting depressed mood based on self-reported histories via recurrent neural networks. In: Proceedings of the 26th International Conference on World Wide Web, pp. 715–724. International World Wide Web Conferences Steering Committee (2017) Suhara, Y., Xu, Y., Pentland, A.: DeepMood: forecasting depressed mood based on self-reported histories via recurrent neural networks. In: Proceedings of the 26th International Conference on World Wide Web, pp. 715–724. International World Wide Web Conferences Steering Committee (2017)
24.
Zurück zum Zitat Tang, D., Wei, F., Yang, N., Zhou, M., Liu, T., Qin, B.: Learning sentiment-specific word embedding for Twitter sentiment classification. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), vol. 1, pp. 1555–1565 (2014) Tang, D., Wei, F., Yang, N., Zhou, M., Liu, T., Qin, B.: Learning sentiment-specific word embedding for Twitter sentiment classification. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), vol. 1, pp. 1555–1565 (2014)
25.
Zurück zum Zitat Tennant, R., et al.: The Warwick-Edinburgh mental well-being scale (WEMWBS): development and UK validation. Health Qual. Life Outcomes 5(1), 63 (2007)CrossRef Tennant, R., et al.: The Warwick-Edinburgh mental well-being scale (WEMWBS): development and UK validation. Health Qual. Life Outcomes 5(1), 63 (2007)CrossRef
26.
Zurück zum Zitat Tsakalidis, A., Liakata, M., Damoulas, T., Jellinek, B., Guo, W., Cristea, A.I.: Combining heterogeneous user generated data to sense well-being. In: Proceedings of the 26th International Conference on Computational Linguistics, pp. 3007–3018 (2016) Tsakalidis, A., Liakata, M., Damoulas, T., Jellinek, B., Guo, W., Cristea, A.I.: Combining heterogeneous user generated data to sense well-being. In: Proceedings of the 26th International Conference on Computational Linguistics, pp. 3007–3018 (2016)
27.
Zurück zum Zitat Wang, R., et al.: CrossCheck: toward passive sensing and detection of mental health changes in people with schizophrenia. In: Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing, pp. 886–897. ACM (2016) Wang, R., et al.: CrossCheck: toward passive sensing and detection of mental health changes in people with schizophrenia. In: Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing, pp. 886–897. ACM (2016)
28.
Zurück zum Zitat Wang, R., et al.: StudentLife: assessing mental health, academic performance and behavioral trends of college students using smartphones. In: Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing, pp. 3–14. ACM (2014) Wang, R., et al.: StudentLife: assessing mental health, academic performance and behavioral trends of college students using smartphones. In: Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing, pp. 3–14. ACM (2014)
29.
Zurück zum Zitat Watson, D., Clark, L.A., Tellegen, A.: Development and validation of brief measures of positive and negative affect: the PANAS scales. J. Pers. Soc. Psychol. 54(6), 1063 (1988)CrossRef Watson, D., Clark, L.A., Tellegen, A.: Development and validation of brief measures of positive and negative affect: the PANAS scales. J. Pers. Soc. Psychol. 54(6), 1063 (1988)CrossRef
30.
Zurück zum Zitat Zhu, X., Kiritchenko, S., Mohammad, S.M.: NRC-Canada-2014: recent improvements in the sentiment analysis of Tweets. In: Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014), pp. 443–447. Citeseer (2014) Zhu, X., Kiritchenko, S., Mohammad, S.M.: NRC-Canada-2014: recent improvements in the sentiment analysis of Tweets. In: Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014), pp. 443–447. Citeseer (2014)
Metadaten
Titel
Can We Assess Mental Health Through Social Media and Smart Devices? Addressing Bias in Methodology and Evaluation
verfasst von
Adam Tsakalidis
Maria Liakata
Theo Damoulas
Alexandra I. Cristea
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-030-10997-4_25