skip to main content
10.1145/3097983.3097997acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article
Public Access

Patient Subtyping via Time-Aware LSTM Networks

Authors Info & Claims
Published:04 August 2017Publication History

ABSTRACT

In the study of various diseases, heterogeneity among patients usually leads to different progression patterns and may require different types of therapeutic intervention. Therefore, it is important to study patient subtyping, which is grouping of patients into disease characterizing subtypes. Subtyping from complex patient data is challenging because of the information heterogeneity and temporal dynamics. Long-Short Term Memory (LSTM) has been successfully used in many domains for processing sequential data, and recently applied for analyzing longitudinal patient records. The LSTM units are designed to handle data with constant elapsed times between consecutive elements of a sequence. Given that time lapse between successive elements in patient records can vary from days to months, the design of traditional LSTM may lead to suboptimal performance. In this paper, we propose a novel LSTM unit called Time-Aware LSTM (T-LSTM) to handle irregular time intervals in longitudinal patient records. We learn a subspace decomposition of the cell memory which enables time decay to discount the memory content according to the elapsed time. We propose a patient subtyping model that leverages the proposed T-LSTM in an auto-encoder to learn a powerful single representation for sequential records of patients, which are then used to cluster patients into clinical subtypes. Experiments on synthetic and real world datasets show that the proposed T-LSTM architecture captures the underlying structures in the sequences with time irregularities.

Skip Supplemental Material Section

Supplemental Material

baytas_patient_subtyping.mp4

mp4

394 MB

References

  1. Yoshua Bengio, Aaron Courville, and Pascal Vincent. 2014. Representation Learning: A Review and New Perspectives. arXiv:1206.5538v3[cs.LG] (2014). https://arxiv.org/abs/1206.5538Google ScholarGoogle Scholar
  2. Yoshua Bengio, Patrice Simard, and Paolo Frasconi. 1994. Learning Long-Term Dependencies with Gradient Descent is Difficult. IEEE Transactions on Neural Networks Vol. 5, 2 (March 1994), 157--166. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Chao Che, Cao Xiao, Jian Liang, Bo Jin, Jiayu Zhou, and Fei Wang. 2017. An RNN Architecture with Dynamic Temporal Matching for Personalized Predictions of Parkinson's Disease. In Proceedings of the 2017 SIAM International Conference on Data Mining. Google ScholarGoogle ScholarCross RefCross Ref
  4. Zhengping Che, David Kale, Wenzhe Li, Mohammad Taha Bahadori, and Yan Liu 2015. Deep Computational Phenotyping. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 507--516. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Zhengping Che, Sanjay Purushotham, Kyunghyun Cho, David Sontag, and Yan Liu 2016. Recurrent Neural Networks for Multivariate Time Series with Missing Values. arXiv preprint arXiv:1606.01865 (2016).Google ScholarGoogle Scholar
  6. Kyunghyun Cho, Bart van Merrienboer, Caglar Gulcehre, and Dzmitry Bahdanau et al. 2014. Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation. arXiv:1406.1078v3[cs.CL] (2014). https://arxiv.org/pdf/1406.1078v3Google ScholarGoogle Scholar
  7. Edward Choi, Mohammad Taha Bahadori, Andy Schuetzy, Walter F. Stewarty, and Jimeng Sun. 2016. Doctor AI: Predicting Clinical Events via Recurrent Neural Networks. arXiv:1511.05942v11 [cs.LG] (2016). https://arxiv.org/pdf/1511.05942v11.pdfGoogle ScholarGoogle Scholar
  8. Edward Choi, Mohammad Taha Bahadori, Andy Schuetzy, Walter F. Stewarty, and Jimeng Sun. 2016natexlabb. RETAIN: Interpretable Predictive Model in Healthcare using Reverse Time Attention Mechanism. arXiv:1608.05745v3 [cs.LG] (2016). https://arxiv.org/pdf/1608.05745v3.pdfGoogle ScholarGoogle Scholar
  9. Edward Choi, Mohammad Taha Bahadori, Elizabeth Searles, Catherine Coffey, Michael Thompson, James Bost, Javier Tejedor-Sojo, and Jimeng Sun. 2016. Multi-layer Representation Learning for Medical Concepts Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining KDD16. Association for Computing Machinery (ACM). http://dx.doi.org/10.1145/2939672.2939823 Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Ivo D. Dinov, Ben Heavner, Ming Tang, Gustavo Glusman, Kyle Chard, and Mike Darcy et al. 2016. Predictive Big Data Analytics: A Study of Parkinson's Disease Using Large, Complex, Heterogeneous, Incongruent, Multi-Source and Incomplete Observations. PLoS ONE, Vol. 11, (8):e0157077 (August 2016). Google ScholarGoogle ScholarCross RefCross Ref
  11. Jeff Donahue, Lisa Anne Hendricks, Marcus Rohrbach, Subhashini Venugopalan, Sergio Guadarrama, Kate Saenko, and Trevor Darrell. 2016. Long-term Recurrent Convolutional Networks for Visual Recognition and Description. arXiv:1411.4389v4[cs.CV] (2016). https://arxiv.org/pdf/1411.4389.pdfGoogle ScholarGoogle Scholar
  12. Cristobal Esteban, Oliver Staeck, Yinchong Yang, and Volker Tresp 2016. Predicting Clinical Events by Combining Static and Dynamic Information Using Recurrent Neural Networks. arXiv:1602.02685v1 [cs.LG] (2016). https://arxiv.org/pdf/1602.02685v1.pdfGoogle ScholarGoogle Scholar
  13. Seyed-Mohammad Fereshtehnejad, Silvia Ríos-Romenets, Julius B. M. Anang, and Ronald B. Postuma. 2015. New Clinical Subtypes of Parkinson Disease and Their Longitudinal Progression: A Prospective Cohort Comparison With Other Phenotypes. JAMA Neurol, Vol. 72, 8 (2015), 863--873. Google ScholarGoogle ScholarCross RefCross Ref
  14. Alex Graves, Abdel rahman Mohamed, and Geoffrey Hinton. 2013. Speech Recognition with Deep Recurrent Neural Networks. arXiv:1303.5778[cs.NE] (2013). https://arxiv.org/abs/1303.5778Google ScholarGoogle Scholar
  15. Joyce C Ho, Joydeep Ghosh, and Jimeng Sun 2014. Marble: High-Throughput Phenotyping from Electronic Health Records via Sparse Nonnegative Tensor Factorization. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 115--124.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Sepp Hochreiter and Jürgen Schmidhuber 1997. Long Short-Term Memory. Neural computation, Vol. 9, 8 (1997), 1735--1780. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Alistair EW Johnson, Tom J Pollard, Lu Shen, Li-wei H Lehman, Mengling Feng, Mohammad Ghassemi, Benjamin Moody, Peter Szolovits, Leo Anthony Celi, and Roger G Mark. 2016. MIMIC-III, A Freely Accessible Critical Care Database. Scientific Data Vol. 3 (2016).Google ScholarGoogle Scholar
  18. Uri Kartoun. 2016. A Methodology to Generate Virtual Patient Repositories. arXiv:1608.00570 [cs.CY] (2016). https://arxiv.org/ftp/arxiv/papers/1608/1608.00570.pdfGoogle ScholarGoogle Scholar
  19. Siwei Lai, Liheng Xu, Kang Liu, and Jun Zhao. 2015. Recurrent Convolutional Neural Networks for Text Classification Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence.Google ScholarGoogle Scholar
  20. Zachary C. Lipton, David C. Kale, Charles Elkan, and Randall" Wetzell 2016. Learning to Diagnose with LSTM Recurrent Neural Networks. arXiv:1511.03677v6 [cs.LG] (2016). https://arxiv.org/pdf/1511.03677v6.pdfGoogle ScholarGoogle Scholar
  21. Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schutze 2008. Introduction to Information Retrieval. Cambridge University Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Benjamin M. Marlin, David C. Kale, Robinder G. Khemani, and Randall C. Wetzel 2012. Unsupervised Pattern Discovery in Electronic Health Care Data using Probabilistic Clustering Models Proceedings of the 2nd ACM SIGHIT International Health Informatics Symposium. ACM, 389--398.Google ScholarGoogle Scholar
  23. Trang Pham, Truyen Tran, Dinh Phung, and Svetha Vankatesh. 2016. DeepCare: A Deep Dynamic Memory Model for Predictive Medicine. arxiv:1602.00357v1 [stat.ML] (February 2016). https://arxiv.org/pdf/1602.00357v1.pdfGoogle ScholarGoogle Scholar
  24. Nitish Srivastava, Elman Mansimov, and Ruslan Salakhutdinov. 2016. Unsupervised Learning of Video Representations using LSTM. arXiv:1502.04681v3[cs.LG] (2016). https://arxiv.org/abs/1502.04681Google ScholarGoogle Scholar
  25. Tsung-Hsien Wen, Milica Gasic, Nikola Mrksic, Pei-Hao Su, David Vandyke, and Steve Young. 2015. Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. 1711--1721.Google ScholarGoogle Scholar
  26. Ting Xiang, Debajyoti Ray, Terry Lohrenz, Peter Dayan, and P Read Montague 2012. Computational Phenotyping of Two-person Interactions Reveals Differential Neural Response to Depth-of-thought. PLoS Comput Biol, Vol. 8, 12 (2012), e1002841. Google ScholarGoogle ScholarCross RefCross Ref
  27. Yu Zhang, I-Wei Wu, Duygu Tosun, Eric Foster, and Norbert Schuff 2016. Progression of Regional Microstructural Degeneration in Parkinson's Disease: A Multicenter Diffusion Tensor Imaging Study. PLOS ONE (2016).Google ScholarGoogle Scholar
  28. Jiayu Zhou, Zhaosong Lu, Jimeng Sun, Lei Yuan, Fei Wang, and Jieping Ye. 2013. Feafiner: Biomarker Identification from Medical Data Through Feature Generalization and Selection. In Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1034--1042. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Jiayu Zhou, Fei Wang, Jianying Hu, and Jieping Ye. 2014. From Micro to Macro: Data Driven Phenotyping by Densification of Longitudinal Electronic Medical Records. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 135--144. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Jiayu Zhou, Lei Yuan, Jun Liu, and Jieping Ye. 2017. A Multi-Task Learning Formulation for Predicting Disease Progression Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 814--822.Google ScholarGoogle Scholar
  31. Xiaoqiang Zhou, Baotian Hu, Qingcai Chen, and Xiaolong Wang. 2015. An Auto-Encoder for Learning Conversation Representation Using LSTM Proceedings of the 22nd International Conference on Neural Information Processing, ICONIP 2015. 310--317. http://dx.doi.org/10.1007/978-3-319-26532-2_34 Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Patient Subtyping via Time-Aware LSTM Networks

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
          August 2017
          2240 pages
          ISBN:9781450348874
          DOI:10.1145/3097983

          Copyright © 2017 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 4 August 2017

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          KDD '17 Paper Acceptance Rate64of748submissions,9%Overall Acceptance Rate1,133of8,635submissions,13%

          Upcoming Conference

          KDD '24

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader