Skip to main content
Top

2025 | OriginalPaper | Chapter

Mild Cognitive Impairment Prediction Using Facial and Speech Data

Authors : Chien-Cheng Lee, Wei-Chieh Huang, Yi-Fang Chuang

Published in: Advances in Mobile Computing and Multimedia Intelligence

Publisher: Springer Nature Switzerland

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Mild cognitive impairment (MCI) represents a transitional stage between the cognitive decline associated with normal aging and more severe conditions such as dementia. Early diagnosis of MCI is crucial for effective healthcare intervention. However, current detection methods are often costly and time-consuming. This study introduces a multimodal fusion network (MFN) designed to predict MCI more efficiently. The proposed network utilizes dual-stream ResNets to process both facial and speech features. These features, extracted from the convolutional and subsampling layers of the ResNets, are subsequently fused in a fully connected layer to generate the final prediction. The dataset comprises a total of 52 participant videos, with an equal distribution: 26 videos from participants with normal cognitive function and 26 videos from participants diagnosed with MCI. Experimental results demonstrate the effectiveness of this approach, with an F1 score of 0.89 across test participants.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Tombaugh, T.N., McIntyre, N.J.: The mini-mental state examination: a comprehensive review. J. Am. Geriatr. Soc. 40, 922–935 (1992)CrossRef Tombaugh, T.N., McIntyre, N.J.: The mini-mental state examination: a comprehensive review. J. Am. Geriatr. Soc. 40, 922–935 (1992)CrossRef
2.
go back to reference Themistocleous, C., Eckerström, M., Kokkinakis, D.: Voice quality and speech fluency distinguish individuals with mild cognitive impairment from healthy controls. PLoS ONE 15, e0236009 (2020)CrossRef Themistocleous, C., Eckerström, M., Kokkinakis, D.: Voice quality and speech fluency distinguish individuals with mild cognitive impairment from healthy controls. PLoS ONE 15, e0236009 (2020)CrossRef
3.
go back to reference Yu, B., Williamson, J.R., Mundt, J.C., Quatieri, T.F.: Speech-based automated cognitive impairment detection from remotely-collected cognitive test audio. IEEE Access 6, 40494–40505 (2018)CrossRef Yu, B., Williamson, J.R., Mundt, J.C., Quatieri, T.F.: Speech-based automated cognitive impairment detection from remotely-collected cognitive test audio. IEEE Access 6, 40494–40505 (2018)CrossRef
4.
go back to reference Tanaka, H., Adachi, H., Kazui, H., Ikeda, M., Kudo, T., Nakamura, S.: Detecting dementia from face in human-agent interaction. In: Adjunct of the 2019 International Conference on Multimodal Interaction, pp. 1–6 (2019) Tanaka, H., Adachi, H., Kazui, H., Ikeda, M., Kudo, T., Nakamura, S.: Detecting dementia from face in human-agent interaction. In: Adjunct of the 2019 International Conference on Multimodal Interaction, pp. 1–6 (2019)
5.
go back to reference Wang, Y., Dantcheva, A., Broutart, J.C., Robert, P., Bremond, F., Bilinski, P.: Comparing methods for assessment of facial dynamics in patients with major neurocognitive disorders. In: Leal-Taixé, L., Roth, S. (eds.) Computer Vision – ECCV 2018 Workshops. ECCV 2018. LNCS, vol. 11134, pp. 144–157. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-11024-6_10 Wang, Y., Dantcheva, A., Broutart, J.C., Robert, P., Bremond, F., Bilinski, P.: Comparing methods for assessment of facial dynamics in patients with major neurocognitive disorders. In: Leal-Taixé, L., Roth, S. (eds.) Computer Vision – ECCV 2018 Workshops. ECCV 2018. LNCS, vol. 11134, pp. 144–157. Springer, Cham (2019). https://​doi.​org/​10.​1007/​978-3-030-11024-6_​10
6.
7.
go back to reference Kong, Q., Cao, Y., Iqbal, T., Wang, Y., Wang, W., Plumbley, M.D.: PANNs: large-scale pretrained audio neural networks for audio pattern recognition. IEEE/ACM Trans. Audio Speech Lang. Process. 28, 2880–2894 (2020)CrossRef Kong, Q., Cao, Y., Iqbal, T., Wang, Y., Wang, W., Plumbley, M.D.: PANNs: large-scale pretrained audio neural networks for audio pattern recognition. IEEE/ACM Trans. Audio Speech Lang. Process. 28, 2880–2894 (2020)CrossRef
8.
go back to reference Park, T.J., Koluguri, N.R., Balam, J., Ginsburg, B.: Multi-scale speaker diarization with dynamic scale weighting. arXiv preprint arXiv:2203.15974 (2022) Park, T.J., Koluguri, N.R., Balam, J., Ginsburg, B.: Multi-scale speaker diarization with dynamic scale weighting. arXiv preprint arXiv:​2203.​15974 (2022)
10.
go back to reference Baltrušaitis, T., Robinson, P., Morency, L.-P.: Openface: an open source facial behavior analysis toolkit. In: 2016 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 1–10. IEEE (2016) Baltrušaitis, T., Robinson, P., Morency, L.-P.: Openface: an open source facial behavior analysis toolkit. In: 2016 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 1–10. IEEE (2016)
Metadata
Title
Mild Cognitive Impairment Prediction Using Facial and Speech Data
Authors
Chien-Cheng Lee
Wei-Chieh Huang
Yi-Fang Chuang
Copyright Year
2025
DOI
https://doi.org/10.1007/978-3-031-78049-3_9

Premium Partner