nach oben

Erschienen in:

2021 | OriginalPaper | Buchkapitel

TransICD: Transformer Based Code-Wise Attention Model for Explainable ICD Coding

verfasst von : Biplob Biswas, Thai-Hoang Pham, Ping Zhang

Erschienen in: Artificial Intelligence in Medicine

Verlag: Springer International Publishing

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

International Classification of Disease (ICD) coding procedure which refers to tagging medical notes with diagnosis codes has been shown to be effective and crucial to the billing system in medical sector. Currently, ICD codes are assigned to a clinical note manually which is likely to cause many errors. Moreover, training skilled coders also requires time and human resources. Therefore, automating the ICD code determination process is an important task. With the advancement of artificial intelligence theory and computational hardware, machine learning approach has emerged as a suitable solution to automate this process. In this project, we apply a transformer-based architecture to capture the interdependence among the tokens of a document and then use a code-wise attention mechanism to learn code-specific representations of the entire document. Finally, they are fed to separate dense layers for corresponding code prediction. Furthermore, to handle the imbalance in the code frequency of clinical datasets, we employ a label distribution aware margin (LDAM) loss function. The experimental results on the MIMIC-III dataset show that our proposed model outperforms other baselines by a significant margin. In particular, our best setting achieves a micro-AUC score of 0.923 compared to 0.868 of bidirectional recurrent neural networks. We also show that by using the code-wise attention mechanism, the model can provide more insights about its prediction, and thus it can support clinicians to make reliable decisions. Our code is available online (https://github.com/biplob1ly/TransICD).

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel Modelling and Assessment of One-Drug Dose Titration

Nächstes Kapitel Improving Prediction of Low-Prior Clinical Events with Simultaneous General Patient-State Representation Learning

Alsentzer, E., et al.: Publicly available clinical BERT embeddings. In: Proceedings of Clinical NLP, pp. 72–78, June 2019

Ayyar, S., Don, O., Iv, W.: Tagging patient notes with ICD-9 codes. In: Proceedings of NeurIPS, pp. 1–8 (2016)

Baumel, T., Nassour-Kassis, J., Elhadad, M., Elhadad, N.: Multi-label classification of patient notes a case study on ICD code assignment. ArXiv abs/1709.09587 (2018)

Cao, K., Wei, C., Gaidon, A., Aréchiga, N., Ma, T.: Learning imbalanced datasets with label-distribution-aware margin loss. In: Wallach, H.M., Larochelle, H., Beygelzimer, A., d’Alché-Buc, F., Fox, E.B., Garnett, R. (eds.) Proceedings of NeurIPS, pp. 1565–1576 (2019)

Cho, K., et al.: Learning phrase representations using RNN encoder-decoder for statistical machine translation. In: Proceedings of EMNLP, pp. 1724–1734, October 2014

Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)

Grand View Research: U.S. Medical Coding Market Size, Share and Trends Analysis Report By Classification System (ICD, HCPCS, CPT), By Component (In-house, Outsourced), And Segment Forecasts, 2019–2025 (2019). https://www.grandviewresearch.com/industry-analysis/us-medical-coding-market

Johnson, A.E., et al.: MIMIC-III, a freely accessible critical care database. Scientific Data 3(1), 160035 (2016)

Kim, Y.: Convolutional neural networks for sentence classification. In: Proceedings of EMNLP, pp. 1746–1751, October 2014

10.

Larkey, L.S., Croft, W.B.: Combining classifiers in text categorization. In: Proceedings of SIGIR, SIGIR 1996, pp. 289–297 (1996)

11.

Lee, J., et al.: BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics 36(4), 1234–1240 (2019)

12.

de Lima, L.R.S., Laender, A.H.F., Ribeiro-Neto, B.A.: A hierarchical approach to the automatic categorization of medical documents. In: Proceedings of CIKM, CIKM 1998, pp. 132–139 (1998)

13.

Lin, Z., et al.: A structured self-attentive sentence embedding. CoRR abs/1703.03130 (2017)

14.

Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, Cambridge (2008)CrossRef

15.

Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)

16.

Mullenbach, J., Wiegreffe, S., Duke, J., Sun, J., Eisenstein, J.: Explainable prediction of medical codes from clinical text. In: Proceedings of NAACL-HLT, pp. 1101–1111, June 2018

17.

Perotte, A., Pivovarov, R., Natarajan, K., Weiskopf, N., Wood, F., Elhadad, N.: Diagnosis code assignment: models and evaluation metrics. JAMIA 21, 231–237 (2013)

18.

Prakash, A., et al.: Condensed memory networks for clinical diagnostic inferencing. In: Proceedings of AAAI, AAAI 2017, pp. 3274–3280. AAAI Press (2017)

19.

Shi, H., Xie, P., Hu, Z., Zhang, M., Xing, E.P.: Towards automated ICD coding using deep learning. CoRR abs/1711.04075 (2017)

20.

Song, C., Zhang, S., Sadoughi, N., Xie, P., Xing, E.: Generalized zero-shot text classification for ICD coding. In: Bessiere, C. (ed.) Proceedings of IJCAI, pp. 4018–4024, July 2020

21.

Vaswani, A., et al.: Attention is all you need. In: Guyon, I., et al. (eds.) Advances in NeurIPS, vol. 30, pp. 5998–6008. Curran Associates, Inc. (2017)

22.

Wang, G., et al.: Joint embedding of words and labels for text classification. In: Proceedings of ACL, pp. 2321–2331, July 2018

23.

WHO: International classification of diseases (ICD) information sheet: World health organization (2014). https://www.who.int/classifications/icd/factsheet/en/

Titel: TransICD: Transformer Based Code-Wise Attention Model for Explainable ICD Coding
verfasst von: Biplob Biswas
Thai-Hoang Pham
Ping Zhang
Verlag: Springer International Publishing
Buch: Artificial Intelligence in Medicine
Print ISBN: 978-3-030-77210-9

Electronic ISBN: 978-3-030-77211-6

Copyright-Jahr: 2021
DOI: https://doi.org/10.1007/978-3-030-77211-6_56

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"