nach oben

Data Mining and Knowledge Discovery

Erschienen in:

01.07.2015

Constrained elastic net based knowledge transfer for healthcare information exchange

verfasst von: Yan Li, Bhanukiran Vinzamuri, Chandan K. Reddy

Erschienen in: Data Mining and Knowledge Discovery | Ausgabe 4/2015

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Transfer learning methods have been successfully applied in solving a wide range of real-world problems. However, there is almost no attempt of effectively using these methods in healthcare applications. In the healthcare domain, it becomes extremely critical to solve the “when to transfer” issue of transfer learning. In highly divergent source and target domains, transfer learning can lead to negative transfer. Most of the existing works in transfer learning are primarily focused on selecting useful information from the source to improve the performance of the target task, but whether the transfer learning can help and when the transfer learning should be applied in the target task are still some of the impending challenges. In this paper, we address this issue of “when to transfer” by proposing a sparse feature selection model based on the constrained elastic net penalty. As a case study of the proposed model, we demonstrate the performance using the diabetes electronic health records (EHRs) which contain patient records from all fifty states in the United States. Our approach can choose relevant features to transfer knowledge from the source to the target tasks. The proposed model can measure the differences between multivariate data distributions conditional on the predicted model, and based on this measurement we can avoid unsuccessful transfer. We successfully transfer the knowledge across different states to improve the diagnosis of diabetes in a certain state with insufficient records to build an individualized predictive model with the aid of information from other states.

Vorheriger Artikel A relative similarity based method for interactive patient risk prediction

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Although in Zhou et al. (2012) it has been named as multi-task Lasso, both \(L_1\)-norm and \(L_2\)-norm penalties are used in the optimization formulation.

Arnold A, Nallapati R, Cohen WW (2007) A comparative study of methods for transductive transfer learning. In: Seventh IEEE international conference on data mining workshops, 2007. ICDM Workshops 2007, p 77–82

Blitzer J, Dredze M, Pereira F (2007) Biographies, bollywood, boom-boxes and blenders: Domain adaptation for sentiment classification. ACL 7:440–447

Caruana R (1997) Multitask learning. Mach Learn 28(1):41–75MathSciNetCrossRef

Dai W, Yang Q, Xue G, Yu Y (2007) Boosting for transfer learning. In: ICML’07: Proceedings of the 24th international conference on Machine learning, p 193–200

Dai W, Yang Q, Xue GR, Yu Y (2008) Self-taught clustering. In: Proceedings of the 25th international conference on machine learning, ACM, p 200–207

Donoho DL, Johnstone JM (1994) Ideal spatial adaptation by wavelet shrinkage. Biometrika 81(3):425–455MATHMathSciNetCrossRef

Evgeniou A, Pontil M (2007) Multi-task feature learning. In: Proceedings of the 2006 conference on advances in neural information processing systems, vol. 19. The MIT Press, Cambridge, p 41

Evgeniou T, Pontil M (2004) Regularized multi-task learning. In: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, ACM, p 109–117

Farhadi A, Forsyth D, White R (2007) Transfer learning in sign language. In: IEEE Conference on computer vision and pattern recognition, CVPR’07, IEEE, p 1–8

Friedman J, Hastie T, Tibshirani R (2010) Regularization paths for generalized linear models via coordinate descent. J Stat Softw 33(1):1–22

Fung GPC, Yu JX, Lu H, Yu PS (2006) Text classification without negative examples revisit. IEEE Trans Knowl Data Eng 18(1):6–20CrossRef

Hastie T, Tibshirani R, Friedman JJH (2001) The elements of statistical learning. Springer, New YorkMATHCrossRef

Liu J, Ji S, Ye J (2009) Multi-task feature learning via efficient l 2, 1-norm minimization. In: Proceedings of the twenty-fifth conference on uncertainty in artificial intelligence. AUAI Press, Corvallis, p 339–348

Mihalkova L, Mooney RJ (2008) Transfer learning by mapping with minimal target data. In: Proceedings of the AAAI-08 workshop on transfer learning for complex tasks

Pan J (2010) Feature-based transfer learning with real-world applications. Ph.D. thesis, The Hong Kong University of Science and Technology

Pan SJ, Yang Q (2010) A survey on transfer learning. IEEE Trans Knowl Data Eng 22(10):1345–1359CrossRef

Pan SJ, Zheng VW, Yang Q, Hu DH (2008) Transfer learning for wifi-based indoor localization. In: Association for the advancement of artificial intelligence (AAAI) workshop, p 6

Practice Fusion Diabetes Classification: Identify patients diagnosed with Type 2 Diabetes (2012). https://www.kaggle.com/c/pf2012-diabetes

Raina R, Battle A, Lee H, Packer B, Ng AY (2007) Self-taught learning: transfer learning from unlabeled data. In: Proceedings of the 24th international conference on Machine learning, ACM, p 759–766

Rosenstein MT, Marx Z, Kaelbling LP, Dietterich TG (2005) To transfer or not to transfer. In: NIPS 2005 workshop on inductive transfer: 10 years later, vol. 2, p 7

Rückert U, Kramer S (2008) Machine learning and knowledge discovery in databases., Kernel-based inductive transferSpringer, Heidelberg, pp 220–233CrossRef

Tibshirani R (1996) Regression shrinkage and selection via the lasso. J R Stat Soc B 58(1):267–288

Tseng P (2001) Convergence of a block coordinate descent method for nondifferentiable minimization. J Optim Theory Appl 109(3):475–494MATHMathSciNetCrossRef

Ye J, Liu J (2012) Sparse methods for biomedical data. ACM SIGKDD Explor Newslett 14(1):4–15CrossRef

Zhou J, Chen J, Ye J (2012) Malsar: multi-task learning via structural regularization. Arizona State University, Phoenix

Zou H, Hastie T (2005) Regularization and variable selection via the elastic net. J R Stat Soc B 67(2):301–320

Titel: Constrained elastic net based knowledge transfer for healthcare information exchange
verfasst von: Yan Li
Bhanukiran Vinzamuri
Chandan K. Reddy
Publikationsdatum: 01.07.2015
Verlag: Springer US
Erschienen in: Data Mining and Knowledge Discovery / Ausgabe 4/2015
Print ISSN: 1384-5810
Elektronische ISSN: 1573-756X
DOI: https://doi.org/10.1007/s10618-014-0389-3

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Weitere Artikel der Ausgabe 4/2015

Probabilistic change detection and visualization methods for the assessment of temporal stability in biomedical data quality

Generative modeling of repositories of health records for predictive tasks

On mining latent treatment patterns from electronic medical records

Mining strong relevance between heterogeneous entities from unstructured biomedical data

Guest editorial: Special issue on data mining for medicine and healthcare

Classification-driven temporal discretization of multivariate time series

Premium Partner