Skip to main content

2021 | OriginalPaper | Buchkapitel

TransICD: Transformer Based Code-Wise Attention Model for Explainable ICD Coding

verfasst von : Biplob Biswas, Thai-Hoang Pham, Ping Zhang

Erschienen in: Artificial Intelligence in Medicine

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

International Classification of Disease (ICD) coding procedure which refers to tagging medical notes with diagnosis codes has been shown to be effective and crucial to the billing system in medical sector. Currently, ICD codes are assigned to a clinical note manually which is likely to cause many errors. Moreover, training skilled coders also requires time and human resources. Therefore, automating the ICD code determination process is an important task. With the advancement of artificial intelligence theory and computational hardware, machine learning approach has emerged as a suitable solution to automate this process. In this project, we apply a transformer-based architecture to capture the interdependence among the tokens of a document and then use a code-wise attention mechanism to learn code-specific representations of the entire document. Finally, they are fed to separate dense layers for corresponding code prediction. Furthermore, to handle the imbalance in the code frequency of clinical datasets, we employ a label distribution aware margin (LDAM) loss function. The experimental results on the MIMIC-III dataset show that our proposed model outperforms other baselines by a significant margin. In particular, our best setting achieves a micro-AUC score of 0.923 compared to 0.868 of bidirectional recurrent neural networks. We also show that by using the code-wise attention mechanism, the model can provide more insights about its prediction, and thus it can support clinicians to make reliable decisions. Our code is available online (https://​github.​com/​biplob1ly/​TransICD).

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Alsentzer, E., et al.: Publicly available clinical BERT embeddings. In: Proceedings of Clinical NLP, pp. 72–78, June 2019 Alsentzer, E., et al.: Publicly available clinical BERT embeddings. In: Proceedings of Clinical NLP, pp. 72–78, June 2019
2.
Zurück zum Zitat Ayyar, S., Don, O., Iv, W.: Tagging patient notes with ICD-9 codes. In: Proceedings of NeurIPS, pp. 1–8 (2016) Ayyar, S., Don, O., Iv, W.: Tagging patient notes with ICD-9 codes. In: Proceedings of NeurIPS, pp. 1–8 (2016)
3.
Zurück zum Zitat Baumel, T., Nassour-Kassis, J., Elhadad, M., Elhadad, N.: Multi-label classification of patient notes a case study on ICD code assignment. ArXiv abs/1709.09587 (2018) Baumel, T., Nassour-Kassis, J., Elhadad, M., Elhadad, N.: Multi-label classification of patient notes a case study on ICD code assignment. ArXiv abs/1709.09587 (2018)
4.
Zurück zum Zitat Cao, K., Wei, C., Gaidon, A., Aréchiga, N., Ma, T.: Learning imbalanced datasets with label-distribution-aware margin loss. In: Wallach, H.M., Larochelle, H., Beygelzimer, A., d’Alché-Buc, F., Fox, E.B., Garnett, R. (eds.) Proceedings of NeurIPS, pp. 1565–1576 (2019) Cao, K., Wei, C., Gaidon, A., Aréchiga, N., Ma, T.: Learning imbalanced datasets with label-distribution-aware margin loss. In: Wallach, H.M., Larochelle, H., Beygelzimer, A., d’Alché-Buc, F., Fox, E.B., Garnett, R. (eds.) Proceedings of NeurIPS, pp. 1565–1576 (2019)
5.
Zurück zum Zitat Cho, K., et al.: Learning phrase representations using RNN encoder-decoder for statistical machine translation. In: Proceedings of EMNLP, pp. 1724–1734, October 2014 Cho, K., et al.: Learning phrase representations using RNN encoder-decoder for statistical machine translation. In: Proceedings of EMNLP, pp. 1724–1734, October 2014
6.
Zurück zum Zitat Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018) Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:​1810.​04805 (2018)
8.
Zurück zum Zitat Johnson, A.E., et al.: MIMIC-III, a freely accessible critical care database. Scientific Data 3(1), 160035 (2016) Johnson, A.E., et al.: MIMIC-III, a freely accessible critical care database. Scientific Data 3(1), 160035 (2016)
9.
Zurück zum Zitat Kim, Y.: Convolutional neural networks for sentence classification. In: Proceedings of EMNLP, pp. 1746–1751, October 2014 Kim, Y.: Convolutional neural networks for sentence classification. In: Proceedings of EMNLP, pp. 1746–1751, October 2014
10.
Zurück zum Zitat Larkey, L.S., Croft, W.B.: Combining classifiers in text categorization. In: Proceedings of SIGIR, SIGIR 1996, pp. 289–297 (1996) Larkey, L.S., Croft, W.B.: Combining classifiers in text categorization. In: Proceedings of SIGIR, SIGIR 1996, pp. 289–297 (1996)
11.
Zurück zum Zitat Lee, J., et al.: BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics 36(4), 1234–1240 (2019) Lee, J., et al.: BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics 36(4), 1234–1240 (2019)
12.
Zurück zum Zitat de Lima, L.R.S., Laender, A.H.F., Ribeiro-Neto, B.A.: A hierarchical approach to the automatic categorization of medical documents. In: Proceedings of CIKM, CIKM 1998, pp. 132–139 (1998) de Lima, L.R.S., Laender, A.H.F., Ribeiro-Neto, B.A.: A hierarchical approach to the automatic categorization of medical documents. In: Proceedings of CIKM, CIKM 1998, pp. 132–139 (1998)
13.
Zurück zum Zitat Lin, Z., et al.: A structured self-attentive sentence embedding. CoRR abs/1703.03130 (2017) Lin, Z., et al.: A structured self-attentive sentence embedding. CoRR abs/1703.03130 (2017)
14.
Zurück zum Zitat Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, Cambridge (2008)CrossRef Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, Cambridge (2008)CrossRef
15.
Zurück zum Zitat Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013) Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:​1301.​3781 (2013)
16.
Zurück zum Zitat Mullenbach, J., Wiegreffe, S., Duke, J., Sun, J., Eisenstein, J.: Explainable prediction of medical codes from clinical text. In: Proceedings of NAACL-HLT, pp. 1101–1111, June 2018 Mullenbach, J., Wiegreffe, S., Duke, J., Sun, J., Eisenstein, J.: Explainable prediction of medical codes from clinical text. In: Proceedings of NAACL-HLT, pp. 1101–1111, June 2018
17.
Zurück zum Zitat Perotte, A., Pivovarov, R., Natarajan, K., Weiskopf, N., Wood, F., Elhadad, N.: Diagnosis code assignment: models and evaluation metrics. JAMIA 21, 231–237 (2013) Perotte, A., Pivovarov, R., Natarajan, K., Weiskopf, N., Wood, F., Elhadad, N.: Diagnosis code assignment: models and evaluation metrics. JAMIA 21, 231–237 (2013)
18.
Zurück zum Zitat Prakash, A., et al.: Condensed memory networks for clinical diagnostic inferencing. In: Proceedings of AAAI, AAAI 2017, pp. 3274–3280. AAAI Press (2017) Prakash, A., et al.: Condensed memory networks for clinical diagnostic inferencing. In: Proceedings of AAAI, AAAI 2017, pp. 3274–3280. AAAI Press (2017)
19.
Zurück zum Zitat Shi, H., Xie, P., Hu, Z., Zhang, M., Xing, E.P.: Towards automated ICD coding using deep learning. CoRR abs/1711.04075 (2017) Shi, H., Xie, P., Hu, Z., Zhang, M., Xing, E.P.: Towards automated ICD coding using deep learning. CoRR abs/1711.04075 (2017)
20.
Zurück zum Zitat Song, C., Zhang, S., Sadoughi, N., Xie, P., Xing, E.: Generalized zero-shot text classification for ICD coding. In: Bessiere, C. (ed.) Proceedings of IJCAI, pp. 4018–4024, July 2020 Song, C., Zhang, S., Sadoughi, N., Xie, P., Xing, E.: Generalized zero-shot text classification for ICD coding. In: Bessiere, C. (ed.) Proceedings of IJCAI, pp. 4018–4024, July 2020
21.
Zurück zum Zitat Vaswani, A., et al.: Attention is all you need. In: Guyon, I., et al. (eds.) Advances in NeurIPS, vol. 30, pp. 5998–6008. Curran Associates, Inc. (2017) Vaswani, A., et al.: Attention is all you need. In: Guyon, I., et al. (eds.) Advances in NeurIPS, vol. 30, pp. 5998–6008. Curran Associates, Inc. (2017)
22.
Zurück zum Zitat Wang, G., et al.: Joint embedding of words and labels for text classification. In: Proceedings of ACL, pp. 2321–2331, July 2018 Wang, G., et al.: Joint embedding of words and labels for text classification. In: Proceedings of ACL, pp. 2321–2331, July 2018
Metadaten
Titel
TransICD: Transformer Based Code-Wise Attention Model for Explainable ICD Coding
verfasst von
Biplob Biswas
Thai-Hoang Pham
Ping Zhang
Copyright-Jahr
2021
DOI
https://doi.org/10.1007/978-3-030-77211-6_56