research-article

Public Access

Patient Subtyping via Time-Aware LSTM Networks

Authors:
Inci M. Baytas

Michigan State University, East Lansing, MI, USA

Michigan State University, East Lansing, MI, USA
View Profile

,
Cao Xiao

IBM T. J. Watson Research Center, Yorktown Heights, NY, USA

IBM T. J. Watson Research Center, Yorktown Heights, NY, USA
View Profile

,
Xi Zhang

Cornell University, New York, NY, USA

Cornell University, New York, NY, USA
View Profile

,
Fei Wang

Cornell University, New York, NY, USA

Cornell University, New York, NY, USA
View Profile

,
Anil K. Jain

Michigan State University, East Lansing, MI, USA

Michigan State University, East Lansing, MI, USA
View Profile

,
Jiayu Zhou

Michigan State University, East Lansing, MI, USA

Michigan State University, East Lansing, MI, USA
View Profile

KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data MiningAugust 2017Pages 65–74https://doi.org/10.1145/3097983.3097997

Published:04 August 2017Publication History

KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Pages 65–74

ABSTRACT

In the study of various diseases, heterogeneity among patients usually leads to different progression patterns and may require different types of therapeutic intervention. Therefore, it is important to study patient subtyping, which is grouping of patients into disease characterizing subtypes. Subtyping from complex patient data is challenging because of the information heterogeneity and temporal dynamics. Long-Short Term Memory (LSTM) has been successfully used in many domains for processing sequential data, and recently applied for analyzing longitudinal patient records. The LSTM units are designed to handle data with constant elapsed times between consecutive elements of a sequence. Given that time lapse between successive elements in patient records can vary from days to months, the design of traditional LSTM may lead to suboptimal performance. In this paper, we propose a novel LSTM unit called Time-Aware LSTM (T-LSTM) to handle irregular time intervals in longitudinal patient records. We learn a subspace decomposition of the cell memory which enables time decay to discount the memory content according to the elapsed time. We propose a patient subtyping model that leverages the proposed T-LSTM in an auto-encoder to learn a powerful single representation for sequential records of patients, which are then used to cluster patients into clinical subtypes. Experiments on synthetic and real world datasets show that the proposed T-LSTM architecture captures the underlying structures in the sequences with time irregularities.

Supplemental Material

baytas_patient_subtyping.mp4

mp4

394 MB

Download

References

Yoshua Bengio, Aaron Courville, and Pascal Vincent. 2014. Representation Learning: A Review and New Perspectives. arXiv:1206.5538v3[cs.LG] (2014). https://arxiv.org/abs/1206.5538Google Scholar
Yoshua Bengio, Patrice Simard, and Paolo Frasconi. 1994. Learning Long-Term Dependencies with Gradient Descent is Difficult. IEEE Transactions on Neural Networks Vol. 5, 2 (March 1994), 157--166. Google ScholarDigital Library
Chao Che, Cao Xiao, Jian Liang, Bo Jin, Jiayu Zhou, and Fei Wang. 2017. An RNN Architecture with Dynamic Temporal Matching for Personalized Predictions of Parkinson's Disease. In Proceedings of the 2017 SIAM International Conference on Data Mining. Google ScholarCross Ref
Zhengping Che, David Kale, Wenzhe Li, Mohammad Taha Bahadori, and Yan Liu 2015. Deep Computational Phenotyping. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 507--516. Google ScholarDigital Library
Zhengping Che, Sanjay Purushotham, Kyunghyun Cho, David Sontag, and Yan Liu 2016. Recurrent Neural Networks for Multivariate Time Series with Missing Values. arXiv preprint arXiv:1606.01865 (2016).Google Scholar
Kyunghyun Cho, Bart van Merrienboer, Caglar Gulcehre, and Dzmitry Bahdanau et al. 2014. Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation. arXiv:1406.1078v3[cs.CL] (2014). https://arxiv.org/pdf/1406.1078v3Google Scholar
Edward Choi, Mohammad Taha Bahadori, Andy Schuetzy, Walter F. Stewarty, and Jimeng Sun. 2016. Doctor AI: Predicting Clinical Events via Recurrent Neural Networks. arXiv:1511.05942v11 [cs.LG] (2016). https://arxiv.org/pdf/1511.05942v11.pdfGoogle Scholar
Edward Choi, Mohammad Taha Bahadori, Andy Schuetzy, Walter F. Stewarty, and Jimeng Sun. 2016natexlabb. RETAIN: Interpretable Predictive Model in Healthcare using Reverse Time Attention Mechanism. arXiv:1608.05745v3 [cs.LG] (2016). https://arxiv.org/pdf/1608.05745v3.pdfGoogle Scholar
Edward Choi, Mohammad Taha Bahadori, Elizabeth Searles, Catherine Coffey, Michael Thompson, James Bost, Javier Tejedor-Sojo, and Jimeng Sun. 2016. Multi-layer Representation Learning for Medical Concepts Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining KDD16. Association for Computing Machinery (ACM). http://dx.doi.org/10.1145/2939672.2939823 Google ScholarDigital Library
Ivo D. Dinov, Ben Heavner, Ming Tang, Gustavo Glusman, Kyle Chard, and Mike Darcy et al. 2016. Predictive Big Data Analytics: A Study of Parkinson's Disease Using Large, Complex, Heterogeneous, Incongruent, Multi-Source and Incomplete Observations. PLoS ONE, Vol. 11, (8):e0157077 (August 2016). Google ScholarCross Ref
Jeff Donahue, Lisa Anne Hendricks, Marcus Rohrbach, Subhashini Venugopalan, Sergio Guadarrama, Kate Saenko, and Trevor Darrell. 2016. Long-term Recurrent Convolutional Networks for Visual Recognition and Description. arXiv:1411.4389v4[cs.CV] (2016). https://arxiv.org/pdf/1411.4389.pdfGoogle Scholar
Cristobal Esteban, Oliver Staeck, Yinchong Yang, and Volker Tresp 2016. Predicting Clinical Events by Combining Static and Dynamic Information Using Recurrent Neural Networks. arXiv:1602.02685v1 [cs.LG] (2016). https://arxiv.org/pdf/1602.02685v1.pdfGoogle Scholar
Seyed-Mohammad Fereshtehnejad, Silvia Ríos-Romenets, Julius B. M. Anang, and Ronald B. Postuma. 2015. New Clinical Subtypes of Parkinson Disease and Their Longitudinal Progression: A Prospective Cohort Comparison With Other Phenotypes. JAMA Neurol, Vol. 72, 8 (2015), 863--873. Google ScholarCross Ref
Alex Graves, Abdel rahman Mohamed, and Geoffrey Hinton. 2013. Speech Recognition with Deep Recurrent Neural Networks. arXiv:1303.5778[cs.NE] (2013). https://arxiv.org/abs/1303.5778Google Scholar
Joyce C Ho, Joydeep Ghosh, and Jimeng Sun 2014. Marble: High-Throughput Phenotyping from Electronic Health Records via Sparse Nonnegative Tensor Factorization. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 115--124.Google ScholarDigital Library
Sepp Hochreiter and Jürgen Schmidhuber 1997. Long Short-Term Memory. Neural computation, Vol. 9, 8 (1997), 1735--1780. Google ScholarDigital Library
Alistair EW Johnson, Tom J Pollard, Lu Shen, Li-wei H Lehman, Mengling Feng, Mohammad Ghassemi, Benjamin Moody, Peter Szolovits, Leo Anthony Celi, and Roger G Mark. 2016. MIMIC-III, A Freely Accessible Critical Care Database. Scientific Data Vol. 3 (2016).Google Scholar
Uri Kartoun. 2016. A Methodology to Generate Virtual Patient Repositories. arXiv:1608.00570 [cs.CY] (2016). https://arxiv.org/ftp/arxiv/papers/1608/1608.00570.pdfGoogle Scholar
Siwei Lai, Liheng Xu, Kang Liu, and Jun Zhao. 2015. Recurrent Convolutional Neural Networks for Text Classification Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence.Google Scholar
Zachary C. Lipton, David C. Kale, Charles Elkan, and Randall" Wetzell 2016. Learning to Diagnose with LSTM Recurrent Neural Networks. arXiv:1511.03677v6 [cs.LG] (2016). https://arxiv.org/pdf/1511.03677v6.pdfGoogle Scholar
Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schutze 2008. Introduction to Information Retrieval. Cambridge University Press. Google ScholarDigital Library
Benjamin M. Marlin, David C. Kale, Robinder G. Khemani, and Randall C. Wetzel 2012. Unsupervised Pattern Discovery in Electronic Health Care Data using Probabilistic Clustering Models Proceedings of the 2nd ACM SIGHIT International Health Informatics Symposium. ACM, 389--398.Google Scholar
Trang Pham, Truyen Tran, Dinh Phung, and Svetha Vankatesh. 2016. DeepCare: A Deep Dynamic Memory Model for Predictive Medicine. arxiv:1602.00357v1 [stat.ML] (February 2016). https://arxiv.org/pdf/1602.00357v1.pdfGoogle Scholar
Nitish Srivastava, Elman Mansimov, and Ruslan Salakhutdinov. 2016. Unsupervised Learning of Video Representations using LSTM. arXiv:1502.04681v3[cs.LG] (2016). https://arxiv.org/abs/1502.04681Google Scholar
Tsung-Hsien Wen, Milica Gasic, Nikola Mrksic, Pei-Hao Su, David Vandyke, and Steve Young. 2015. Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. 1711--1721.Google Scholar
Ting Xiang, Debajyoti Ray, Terry Lohrenz, Peter Dayan, and P Read Montague 2012. Computational Phenotyping of Two-person Interactions Reveals Differential Neural Response to Depth-of-thought. PLoS Comput Biol, Vol. 8, 12 (2012), e1002841. Google ScholarCross Ref
Yu Zhang, I-Wei Wu, Duygu Tosun, Eric Foster, and Norbert Schuff 2016. Progression of Regional Microstructural Degeneration in Parkinson's Disease: A Multicenter Diffusion Tensor Imaging Study. PLOS ONE (2016).Google Scholar
Jiayu Zhou, Zhaosong Lu, Jimeng Sun, Lei Yuan, Fei Wang, and Jieping Ye. 2013. Feafiner: Biomarker Identification from Medical Data Through Feature Generalization and Selection. In Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1034--1042. Google ScholarDigital Library
Jiayu Zhou, Fei Wang, Jianying Hu, and Jieping Ye. 2014. From Micro to Macro: Data Driven Phenotyping by Densification of Longitudinal Electronic Medical Records. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 135--144. Google ScholarDigital Library
Jiayu Zhou, Lei Yuan, Jun Liu, and Jieping Ye. 2017. A Multi-Task Learning Formulation for Predicting Disease Progression Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 814--822.Google Scholar
Xiaoqiang Zhou, Baotian Hu, Qingcai Chen, and Xiaolong Wang. 2015. An Auto-Encoder for Learning Conversation Representation Using LSTM Proceedings of the 22nd International Conference on Neural Information Processing, ICONIP 2015. 310--317. http://dx.doi.org/10.1007/978-3-319-26532-2_34 Google ScholarCross Ref

Index Terms

Patient Subtyping via Time-Aware LSTM Networks

Recommendations

Software failure time series prediction with RBF, GRNN, and LSTM neural networks
Abstract
The important task of software quality assurance is failure prediction. Time series forecasting methods can be successfully used for this purpose. This paper aims to study and compare the effectiveness of software failure prediction using ...
Read More
Evaluating CNN and LSTM for Web Attack Detection
ICMLC '18: Proceedings of the 2018 10th International Conference on Machine Learning and Computing

Web attack detection is the key task for network security. To tackle this hard problem, this paper explores the deep learning methods, and evaluates convolutional neural network, long-short term memory and their combination method. By comparing with the ...
Read More
Identifying Sepsis Subphenotypes via Time-Aware Multi-Modal Auto-Encoder
KDD '20: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Sepsis is a heterogeneous clinical syndrome that is the leading cause of mortality in hospital intensive care units (ICUs). Identification of sepsis subphenotypes may allow for more precise treatments and lead to more targeted clinical interventions. ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
August 2017
2240 pages
ISBN:9781450348874
DOI:10.1145/3097983
General Chairs:
Stan Matwin
Dalhousie University
,
Shipeng Yu
LinkedIn
,
Faisal Farooq
IBM
Copyright © 2017 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 4 August 2017
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
long-short term memory
patient subtyping
recurrent neural network
Qualifiers
- research-article
Conference

Acceptance Rates
KDD '17 Paper Acceptance Rate64of748submissions,9%Overall Acceptance Rate1,133of8,635submissions,13%
More
Upcoming Conference
KDD '24

Sponsor:

sigkdd

sigkdd

The 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 25 - 29, 2024

Barcelona , Spain
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 327
  Total Citations
  View Citations
- 9,071
  Total Downloads
- Downloads (Last 12 months)1,458
- Downloads (Last 6 weeks)196
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Patient Subtyping via Time-Aware LSTM Networks

KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

ABSTRACT

Supplemental Material

References

Cited By

Index Terms

Recommendations

Software failure time series prediction with RBF, GRNN, and LSTM neural networks

Evaluating CNN and LSTM for Web Attack Detection

Identifying Sepsis Subphenotypes via Time-Aware Multi-Modal Auto-Encoder

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Patient Subtyping via Time-Aware LSTM Networks

KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

ABSTRACT

Supplemental Material

References

Cited By

Index Terms

Recommendations

Software failure time series prediction with RBF, GRNN, and LSTM neural networks

Evaluating CNN and LSTM for Web Attack Detection

Identifying Sepsis Subphenotypes via Time-Aware Multi-Modal Auto-Encoder

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media