Skip to main content
Top
Published in: Neural Computing and Applications 14/2024

17-02-2024 | Original Article

Masked self-supervised ECG representation learning via multiview information bottleneck

Authors: Shunxiang Yang, Cheng Lian, Zhigang Zeng, Bingrong Xu, Yixin Su, Chenyang Xue

Published in: Neural Computing and Applications | Issue 14/2024

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In recent years, self-supervised learning-based models have been widely used for electrocardiogram (ECG) representation learning. However, most of the models utilize contrastive learning that strongly depend on data augmentation. In this paper, we propose a masked self-supervised learning model based on multiview information bottleneck principle. Our method masks the ECG signal instances in the time and frequency domains at a high ratio and then uses the autoencoder to reconstruct the original input. Not only the intra-view relations within each view but also the inter-view relations between two views are exploited in ECG representation learning. Furthermore, we use the multiview information bottleneck principle to remove redundant information in the time and frequency domains, so that the representations of both views contain more task-relevant information. Our model is pre-trained on three larger ECG datasets at once and fine-tuned on each classification task. Experimental results show that our model not only outperforms state-of-the-art models with self-supervised learning, but also outperforms models with supervised learning.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Biel L, Pettersson O, Philipson L, Wide P (2001) Ecg analysis: a new approach in human identification. IEEE Trans Instrum Meas 50(3):808–812CrossRef Biel L, Pettersson O, Philipson L, Wide P (2001) Ecg analysis: a new approach in human identification. IEEE Trans Instrum Meas 50(3):808–812CrossRef
2.
go back to reference Poungponsri S, Yu X (2013) An adaptive filtering approach for electrocardiogram (ecg) signal noise reduction using neural networks. Neurocomputing 117:206–213CrossRef Poungponsri S, Yu X (2013) An adaptive filtering approach for electrocardiogram (ecg) signal noise reduction using neural networks. Neurocomputing 117:206–213CrossRef
3.
go back to reference Yıldırım Ö, Pławiak P, Tan R, Acharya UR (2018) Arrhythmia detection using deep convolutional neural network with long duration ecg signals. Comput Biol Med 102:411–420CrossRef Yıldırım Ö, Pławiak P, Tan R, Acharya UR (2018) Arrhythmia detection using deep convolutional neural network with long duration ecg signals. Comput Biol Med 102:411–420CrossRef
4.
go back to reference Petmezas G, Haris K, Stefanopoulos L, Kilintzis V, Tzavelis A, Rogers JA, Katsaggelos AK, Maglaveras N (2021) Automated atrial fibrillation detection using a hybrid cnn-lstm network on imbalanced ecg datasets. Biomed Signal Process Control 63:102194CrossRef Petmezas G, Haris K, Stefanopoulos L, Kilintzis V, Tzavelis A, Rogers JA, Katsaggelos AK, Maglaveras N (2021) Automated atrial fibrillation detection using a hybrid cnn-lstm network on imbalanced ecg datasets. Biomed Signal Process Control 63:102194CrossRef
5.
go back to reference Jiang K, Liang S, Meng L, Zhang Y, Wang P, Wang W (2020) A two-level attention-based sequence-to-sequence model for accurate inter-patient arrhythmia detection. In: 2020 IEEE international conference on bioinformatics and biomedicine (BIBM). IEEE, pp 1029–1033 Jiang K, Liang S, Meng L, Zhang Y, Wang P, Wang W (2020) A two-level attention-based sequence-to-sequence model for accurate inter-patient arrhythmia detection. In: 2020 IEEE international conference on bioinformatics and biomedicine (BIBM). IEEE, pp 1029–1033
6.
go back to reference Chen T, Kornblith S, Norouzi M, Hinton G (2020) A simple framework for contrastive learning of visual representations. In: International conference on machine learning. PMLR, pp 1597–1607 Chen T, Kornblith S, Norouzi M, Hinton G (2020) A simple framework for contrastive learning of visual representations. In: International conference on machine learning. PMLR, pp 1597–1607
8.
go back to reference Grill JB, Strub F, Altché F, Tallec C, Richemond P, Buchatskaya E, Doersch C, Avila Pires B, Guo Z, Gheshlaghi Azar M et al (2020) Bootstrap your own latent—a new approach to self-supervised learning. Adv Neural Inf Process Syst 33:21271–21284 Grill JB, Strub F, Altché F, Tallec C, Richemond P, Buchatskaya E, Doersch C, Avila Pires B, Guo Z, Gheshlaghi Azar M et al (2020) Bootstrap your own latent—a new approach to self-supervised learning. Adv Neural Inf Process Syst 33:21271–21284
9.
go back to reference Devlin J, Chang M, Lee K, Toutanova K (2018) Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 Devlin J, Chang M, Lee K, Toutanova K (2018) Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:​1810.​04805
10.
11.
go back to reference Gopal B, Han R, Raghupathi G, Ng A, Tison G, Rajpurkar P (2021) 3kg: Contrastive learning of 12-lead electrocardiograms using physiologically-inspired augmentations. In: Machine learning for health. PMLR, pp 156–167 Gopal B, Han R, Raghupathi G, Ng A, Tison G, Rajpurkar P (2021) 3kg: Contrastive learning of 12-lead electrocardiograms using physiologically-inspired augmentations. In: Machine learning for health. PMLR, pp 156–167
12.
go back to reference Kallidromitis K, Gudovskiy D, Kazuki K, Iku O, Rigazio L (2021) Contrastive neural processes for self-supervised learning. In: Asian conference on machine learning. PMLR, pp 594–609 Kallidromitis K, Gudovskiy D, Kazuki K, Iku O, Rigazio L (2021) Contrastive neural processes for self-supervised learning. In: Asian conference on machine learning. PMLR, pp 594–609
13.
go back to reference Mehari T, Strodthoff N (2022) Self-supervised representation learning from 12-lead ecg data. Comput Biol Med 141:105114CrossRef Mehari T, Strodthoff N (2022) Self-supervised representation learning from 12-lead ecg data. Comput Biol Med 141:105114CrossRef
14.
go back to reference Raghu M, Unterthiner T, Kornblith S, Zhang C, Dosovitskiy A (2021) Do vision transformers see like convolutional neural networks? Adv Neural Inf Process Syst 34:12116–12128 Raghu M, Unterthiner T, Kornblith S, Zhang C, Dosovitskiy A (2021) Do vision transformers see like convolutional neural networks? Adv Neural Inf Process Syst 34:12116–12128
15.
go back to reference Le MD, Rathour VS, Truong QS, Mai Q, Brijesh P, Le N (2021) Multi-module recurrent convolutional neural network with transformer encoder for ecg arrhythmia classification. In: 2021 IEEE EMBS international conference on biomedical and health informatics (BHI). IEEE, pp 1–5 Le MD, Rathour VS, Truong QS, Mai Q, Brijesh P, Le N (2021) Multi-module recurrent convolutional neural network with transformer encoder for ecg arrhythmia classification. In: 2021 IEEE EMBS international conference on biomedical and health informatics (BHI). IEEE, pp 1–5
16.
go back to reference Haque AF, Ali MH, Kiber MA (2010) Improved spectrogram analysis for ecg signal in emergency medical applications. Int J Adv Comput Sci Appl 1(3):2010 Haque AF, Ali MH, Kiber MA (2010) Improved spectrogram analysis for ecg signal in emergency medical applications. Int J Adv Comput Sci Appl 1(3):2010
17.
go back to reference Hussein AF, Hashim SJ, Aziz AFA, Rokhani FZ, Adnan WAW (2018) Performance evaluation of time-frequency distributions for ecg signal analysis. J Med Syst 42:1–16CrossRef Hussein AF, Hashim SJ, Aziz AFA, Rokhani FZ, Adnan WAW (2018) Performance evaluation of time-frequency distributions for ecg signal analysis. J Med Syst 42:1–16CrossRef
18.
go back to reference Huang J, Chen B, Yao B, He W (2019) Ecg arrhythmia classification using stft-based spectrogram and convolutional neural network. IEEE Access 7:92871–92880CrossRef Huang J, Chen B, Yao B, He W (2019) Ecg arrhythmia classification using stft-based spectrogram and convolutional neural network. IEEE Access 7:92871–92880CrossRef
19.
go back to reference Chen T, Huang C, Shih ES, Hu Y, Hwang M (2020) Detection and classification of cardiac arrhythmias by a challenge-best deep learning neural network model. Iscience 23(3):100886CrossRef Chen T, Huang C, Shih ES, Hu Y, Hwang M (2020) Detection and classification of cardiac arrhythmias by a challenge-best deep learning neural network model. Iscience 23(3):100886CrossRef
20.
go back to reference Strodthoff N, Wagner P, Schaeffter T, Samek W (2020) Deep learning for ecg analysis: Benchmarks and insights from ptb-xl. IEEE J Biomed Health Inform 25(5):1519–1528CrossRef Strodthoff N, Wagner P, Schaeffter T, Samek W (2020) Deep learning for ecg analysis: Benchmarks and insights from ptb-xl. IEEE J Biomed Health Inform 25(5):1519–1528CrossRef
21.
go back to reference Wang Z, Yan W, Oates T (2017) Time series classification from scratch with deep neural networks: A strong baseline. In: 2017 international joint conference on neural networks (IJCNN). IEEE, pp 1578–1585 Wang Z, Yan W, Oates T (2017) Time series classification from scratch with deep neural networks: A strong baseline. In: 2017 international joint conference on neural networks (IJCNN). IEEE, pp 1578–1585
22.
go back to reference He T, Zhang Z, Zhang H, Zhang Z, Xie J, Li M (2019) Bag of tricks for image classification with convolutional neural networks. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 558–567 He T, Zhang Z, Zhang H, Zhang Z, Xie J, Li M (2019) Bag of tricks for image classification with convolutional neural networks. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 558–567
23.
go back to reference Ismail Fawaz H, Lucas B, Forestier G, Pelletier C, Schmidt DF, Weber J, Webb GI, Idoumghar L, Muller P-A, Petitjean F (2020) Inceptiontime: Finding alexnet for time series classification. Data Min Knowl Disc 34(6):1936–1962MathSciNetCrossRef Ismail Fawaz H, Lucas B, Forestier G, Pelletier C, Schmidt DF, Weber J, Webb GI, Idoumghar L, Muller P-A, Petitjean F (2020) Inceptiontime: Finding alexnet for time series classification. Data Min Knowl Disc 34(6):1936–1962MathSciNetCrossRef
24.
go back to reference Zhang S, Zheng D, Hu X, Yang M (2015) Bidirectional long short-term memory networks for relation classification. In: Proceedings of the 29th Pacific Asia conference on language, information and computation, pp 73–78 Zhang S, Zheng D, Hu X, Yang M (2015) Bidirectional long short-term memory networks for relation classification. In: Proceedings of the 29th Pacific Asia conference on language, information and computation, pp 73–78
25.
go back to reference Fan X, Yao Q, Cai Y, Miao F, Sun F, Li Y (2018) Multiscaled fusion of deep convolutional neural networks for screening atrial fibrillation from single lead short ecg recordings. IEEE J Biomed Health Inform 22(6):1744–1753CrossRef Fan X, Yao Q, Cai Y, Miao F, Sun F, Li Y (2018) Multiscaled fusion of deep convolutional neural networks for screening atrial fibrillation from single lead short ecg recordings. IEEE J Biomed Health Inform 22(6):1744–1753CrossRef
26.
go back to reference Hou B, Yang J, Wang P, Yan R (2019) Lstm-based auto-encoder model for ecg arrhythmias classification. IEEE Trans Instrum Meas 69(4):1232–1240CrossRef Hou B, Yang J, Wang P, Yan R (2019) Lstm-based auto-encoder model for ecg arrhythmias classification. IEEE Trans Instrum Meas 69(4):1232–1240CrossRef
27.
go back to reference Chen H, Wang G, Zhang G, Zhang P, Yang H (2021) Clecg: A novel contrastive learning framework for electrocardiogram arrhythmia classification. IEEE Signal Process Lett 28:1993–1997CrossRef Chen H, Wang G, Zhang G, Zhang P, Yang H (2021) Clecg: A novel contrastive learning framework for electrocardiogram arrhythmia classification. IEEE Signal Process Lett 28:1993–1997CrossRef
28.
go back to reference Kiyasseh D, Zhu T, Clifton DA (2021) Clocs: Contrastive learning of cardiac signals across space, time, and patients. In: International conference on machine learning. PMLR, pp 5606–5615 Kiyasseh D, Zhu T, Clifton DA (2021) Clocs: Contrastive learning of cardiac signals across space, time, and patients. In: International conference on machine learning. PMLR, pp 5606–5615
29.
go back to reference Oh J, Chung H, Kwon J, Hong D, Choi E (2022) Lead-agnostic self-supervised learning for local and global representations of electrocardiogram. In: Conference on health, inference, and learning. PMLR, pp 338–353 Oh J, Chung H, Kwon J, Hong D, Choi E (2022) Lead-agnostic self-supervised learning for local and global representations of electrocardiogram. In: Conference on health, inference, and learning. PMLR, pp 338–353
30.
go back to reference Baevski A, Zhou Y, Mohamed A, Auli M (2020) wav2vec 2.0: A framework for self-supervised learning of speech representations. Adv Neural Inf Process Syst 33:12449–12460 Baevski A, Zhou Y, Mohamed A, Auli M (2020) wav2vec 2.0: A framework for self-supervised learning of speech representations. Adv Neural Inf Process Syst 33:12449–12460
31.
go back to reference Chen X, Ding M, Wang X, Xin Y, Mo S, Wang Y, Han S, Luo P, Zeng G, Wang J (2022) Context autoencoder for self-supervised representation learning. arXiv preprint arXiv:2202.03026 Chen X, Ding M, Wang X, Xin Y, Mo S, Wang Y, Han S, Luo P, Zeng G, Wang J (2022) Context autoencoder for self-supervised representation learning. arXiv preprint arXiv:​2202.​03026
32.
go back to reference Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai X, Unterthiner T, Dehghani M, Minderer M, Heigold G, Gelly S et al (2020) An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai X, Unterthiner T, Dehghani M, Minderer M, Heigold G, Gelly S et al (2020) An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint arXiv:​2010.​11929
33.
go back to reference He K, Chen X, Xie S, Li Y, Dollár P, Girshick R (2022) Masked autoencoders are scalable vision learners. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 16000–16009 He K, Chen X, Xie S, Li Y, Dollár P, Girshick R (2022) Masked autoencoders are scalable vision learners. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 16000–16009
34.
go back to reference Dou Z, Xu Y, Gan Z, Wang J, Wang S, Wang L, Zhu C, Zhang P, Yuan L, Peng N, et al. (2022) An empirical study of training end-to-end vision-and-language transformers. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 18166–18176 Dou Z, Xu Y, Gan Z, Wang J, Wang S, Wang L, Zhu C, Zhang P, Yuan L, Peng N, et al. (2022) An empirical study of training end-to-end vision-and-language transformers. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 18166–18176
36.
go back to reference Federici M, Dutta A, Forré P, Kushman N, Akata Z (2020) Learning robust representations via multi-view information bottleneck. arXiv preprint arXiv:2002.07017 Federici M, Dutta A, Forré P, Kushman N, Akata Z (2020) Learning robust representations via multi-view information bottleneck. arXiv preprint arXiv:​2002.​07017
37.
go back to reference Joyce JM (2011) Kullback–Leibler divergence. In: International encyclopedia of statistical science. Springer, pp 720–722 Joyce JM (2011) Kullback–Leibler divergence. In: International encyclopedia of statistical science. Springer, pp 720–722
38.
go back to reference Shapiro A (2003) Monte Carlo sampling methods. Handb Oper Res Manag Sci 10:353–425MathSciNet Shapiro A (2003) Monte Carlo sampling methods. Handb Oper Res Manag Sci 10:353–425MathSciNet
40.
go back to reference Wagner P, Strodthoff N, Bousseljot RD, Kreiseler D, Lunze FI, Samek W, Schaeffter T (2020) Ptb-xl, a large publicly available electrocardiography dataset. Sci Data 7(1):1–15CrossRef Wagner P, Strodthoff N, Bousseljot RD, Kreiseler D, Lunze FI, Samek W, Schaeffter T (2020) Ptb-xl, a large publicly available electrocardiography dataset. Sci Data 7(1):1–15CrossRef
41.
go back to reference Liu F, Liu C, Zhao L, Zhang X, Wu X, Xu X, Liu Y, Ma C, Wei S, He Z et al (2018) An open access database for evaluating the algorithms of electrocardiogram rhythm and morphology abnormality detection. J Med Imaging Health Inform 8(7):1368–1373CrossRef Liu F, Liu C, Zhao L, Zhang X, Wu X, Xu X, Liu Y, Ma C, Wei S, He Z et al (2018) An open access database for evaluating the algorithms of electrocardiogram rhythm and morphology abnormality detection. J Med Imaging Health Inform 8(7):1368–1373CrossRef
42.
go back to reference Zheng J, Zhang J, Danioko S, Yao H, Guo H, Rakovski C (2020) A 12-lead electrocardiogram database for arrhythmia research covering more than 10,000 patients. Sci Data 7(1):1–8CrossRef Zheng J, Zhang J, Danioko S, Yao H, Guo H, Rakovski C (2020) A 12-lead electrocardiogram database for arrhythmia research covering more than 10,000 patients. Sci Data 7(1):1–8CrossRef
43.
go back to reference Graves A (2012) Long short-term memory. Supervised sequence labelling with recurrent neural networks, pp 37–45 Graves A (2012) Long short-term memory. Supervised sequence labelling with recurrent neural networks, pp 37–45
44.
go back to reference He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778 He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
45.
go back to reference Szegedy C, Ioffe S, Vanhoucke V, Alemi AA (2017) Inception-v4, inception-resnet and the impact of residual connections on learning. In: Thirty-first AAAI conference on artificial intelligence Szegedy C, Ioffe S, Vanhoucke V, Alemi AA (2017) Inception-v4, inception-resnet and the impact of residual connections on learning. In: Thirty-first AAAI conference on artificial intelligence
Metadata
Title
Masked self-supervised ECG representation learning via multiview information bottleneck
Authors
Shunxiang Yang
Cheng Lian
Zhigang Zeng
Bingrong Xu
Yixin Su
Chenyang Xue
Publication date
17-02-2024
Publisher
Springer London
Published in
Neural Computing and Applications / Issue 14/2024
Print ISSN: 0941-0643
Electronic ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-024-09486-4

Other articles of this Issue 14/2024

Neural Computing and Applications 14/2024 Go to the issue

Premium Partner