Skip to main content
Top

2021 | OriginalPaper | Chapter

Using Causal Analysis for Conceptual Deep Learning Explanation

Authors : Sumedha Singla, Stephen Wallace, Sofia Triantafillou, Kayhan Batmanghelich

Published in: Medical Image Computing and Computer Assisted Intervention – MICCAI 2021

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Model explainability is essential for the creation of trustworthy Machine Learning models in healthcare. An ideal explanation resembles the decision-making process of a domain expert and is expressed using concepts or terminology that is meaningful to the clinicians. To provide such explanation, we first associate the hidden units of the classifier to clinically relevant concepts. We take advantage of radiology reports accompanying the chest X-ray images to define concepts. We discover sparse associations between concepts and hidden units using a linear sparse logistic regression. To ensure that the identified units truly influence the classifier’s outcome, we adopt tools from Causal Inference literature and, more specifically, mediation analysis through counterfactual interventions. Finally, we construct a low-depth decision tree to translate all the discovered concepts into a straightforward decision rule, expressed to the radiologist. We evaluated our approach on a large chest x-ray dataset, where our model produces a global explanation consistent with clinical knowledge.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Basu, S., Mitra, S., Saha, N.: Deep learning for screening COVID-19 using chest X-ray images. In: IEEE Symposium Series on Computational Intelligence (SSCI) (2020) Basu, S., Mitra, S., Saha, N.: Deep learning for screening COVID-19 using chest X-ray images. In: IEEE Symposium Series on Computational Intelligence (SSCI) (2020)
2.
go back to reference Bau, D., Zhou, B., Khosla, A., Oliva, A., Torralba, A.: Network dissection: quantifying interpretability of deep visual representations. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 6541–6549 (2017) Bau, D., Zhou, B., Khosla, A., Oliva, A., Torralba, A.: Network dissection: quantifying interpretability of deep visual representations. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 6541–6549 (2017)
3.
go back to reference Bau, D., Zhu, J.Y., Strobelt, H., Lapedriza, A., Zhou, B., Torralba, A.: Understanding the role of individual units in a deep neural network. Nat. Acad. Sci. 117(48), 30071–30078 (2020)CrossRef Bau, D., Zhu, J.Y., Strobelt, H., Lapedriza, A., Zhou, B., Torralba, A.: Understanding the role of individual units in a deep neural network. Nat. Acad. Sci. 117(48), 30071–30078 (2020)CrossRef
4.
go back to reference Clough, J.R., Oksuz, I., Puyol-Antón, E., Ruijsink, B., King, A.P., Schnabel, J.A.: Global and local interpretability for cardiac MRI classification. In: Medical Image Computing and Computer-Assisted Intervention (MICCAI), pp. 656–664 (2019) Clough, J.R., Oksuz, I., Puyol-Antón, E., Ruijsink, B., King, A.P., Schnabel, J.A.: Global and local interpretability for cardiac MRI classification. In: Medical Image Computing and Computer-Assisted Intervention (MICCAI), pp. 656–664 (2019)
5.
go back to reference Glass, A., McGuinness, D.L., Wolverton, M.: Toward establishing trust in adaptive agents. In: International Conference on Intelligent User Interfaces (2008) Glass, A., McGuinness, D.L., Wolverton, M.: Toward establishing trust in adaptive agents. In: International Conference on Intelligent User Interfaces (2008)
6.
go back to reference Graziani, M., Andrearczyk, V., Marchand-Maillet, S., Müller, H.: Concept attribution: explaining CNN decisions to physicians. Comput. Biol. Med. 123, 103865 (2020)CrossRef Graziani, M., Andrearczyk, V., Marchand-Maillet, S., Müller, H.: Concept attribution: explaining CNN decisions to physicians. Comput. Biol. Med. 123, 103865 (2020)CrossRef
7.
go back to reference Huang, G., Liu, Z., Van Der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 4700–4708 (2017) Huang, G., Liu, Z., Van Der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 4700–4708 (2017)
8.
go back to reference Imai, K., Jo, B., Stuart, E.A.: Commentary: using potential outcomes to understand causal mediation analysis. Multivar. Behav. Res. 46(5), 861–873 (2011)CrossRef Imai, K., Jo, B., Stuart, E.A.: Commentary: using potential outcomes to understand causal mediation analysis. Multivar. Behav. Res. 46(5), 861–873 (2011)CrossRef
9.
go back to reference Irvin, J., et al.: Chexpert: a large chest radiograph dataset with uncertainty labels and expert comparison. AAAI Conf. Artif. Intell. 33, 590–597 (2019) Irvin, J., et al.: Chexpert: a large chest radiograph dataset with uncertainty labels and expert comparison. AAAI Conf. Artif. Intell. 33, 590–597 (2019)
10.
go back to reference Johnson, A.E., et al.: Mimic-CXR, a de-identified publicly available database of chest radiographs with free-text reports. Sci. Data 6(1), 1–8 (2019)CrossRef Johnson, A.E., et al.: Mimic-CXR, a de-identified publicly available database of chest radiographs with free-text reports. Sci. Data 6(1), 1–8 (2019)CrossRef
11.
go back to reference Karkhanis, V.S., Joshi, J.M.: Pleural effusion: diagnosis, treatment, and management. Open Access Emerg. Med. (OAEM) 4, 31 (2012)CrossRef Karkhanis, V.S., Joshi, J.M.: Pleural effusion: diagnosis, treatment, and management. Open Access Emerg. Med. (OAEM) 4, 31 (2012)CrossRef
12.
go back to reference Kim, B., Wattenberg, M., Gilmer, J., Cai, C., Wexler, J., Viegas, F., et al.: Interpretability beyond feature attribution: quantitative testing with concept activation vectors (TCAV). In: International Conference on Machine Learning (ICML), pp. 2668–2677 (2018) Kim, B., Wattenberg, M., Gilmer, J., Cai, C., Wexler, J., Viegas, F., et al.: Interpretability beyond feature attribution: quantitative testing with concept activation vectors (TCAV). In: International Conference on Machine Learning (ICML), pp. 2668–2677 (2018)
13.
go back to reference Lundberg, S.M., Lee, S.I.: A unified approach to interpreting model predictions. Adv. Neural. Inf. Process. Syst. 30, 4765–4774 (2017) Lundberg, S.M., Lee, S.I.: A unified approach to interpreting model predictions. Adv. Neural. Inf. Process. Syst. 30, 4765–4774 (2017)
14.
go back to reference Milne, E., Pistolesi, M., Miniati, M., Giuntini, C.: The radiologic distinction of cardiogenic and noncardiogenic edema. Am. J. Roentgenol. 144(5), 879–894 (1985)CrossRef Milne, E., Pistolesi, M., Miniati, M., Giuntini, C.: The radiologic distinction of cardiogenic and noncardiogenic edema. Am. J. Roentgenol. 144(5), 879–894 (1985)CrossRef
15.
go back to reference Nakamori, N., MacMahon, H., Sasaki, Y., Montner, S., et al.: Effect of heart-size parameters computed from digital chest radiographs on detection of cardiomegaly. potential usefulness for computer-aided diagnosis. Invest. Radiol. 26(6), 546–550 (1991) Nakamori, N., MacMahon, H., Sasaki, Y., Montner, S., et al.: Effect of heart-size parameters computed from digital chest radiographs on detection of cardiomegaly. potential usefulness for computer-aided diagnosis. Invest. Radiol. 26(6), 546–550 (1991)
16.
go back to reference Pearl, J.: Direct and indirect effects. In: Conference on Uncertainty and Artificial Intelligence (UAI), pp. 411–420 (2001) Pearl, J.: Direct and indirect effects. In: Conference on Uncertainty and Artificial Intelligence (UAI), pp. 411–420 (2001)
17.
go back to reference Ribeiro, M.T., Singh, S., Guestrin, C.: Why should i trust you? explaining the predictions of any classifier. In: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1135–1144 (2016) Ribeiro, M.T., Singh, S., Guestrin, C.: Why should i trust you? explaining the predictions of any classifier. In: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1135–1144 (2016)
18.
go back to reference Rubin, D.B.: Estimating causal effects of treatments in randomized and nonrandomized studies. J. Educ. Psychol. 66(5), 688 (1974)CrossRef Rubin, D.B.: Estimating causal effects of treatments in randomized and nonrandomized studies. J. Educ. Psychol. 66(5), 688 (1974)CrossRef
19.
go back to reference Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., Batra, D.: Grad-cam: Visual explanations from deep networks via gradient-based localization. In: International Conference on Computer Vision (ICCV), pp. 618–626 (2017) Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., Batra, D.: Grad-cam: Visual explanations from deep networks via gradient-based localization. In: International Conference on Computer Vision (ICCV), pp. 618–626 (2017)
20.
go back to reference Singla, S., Pollack, B., Chen, J., Batmanghelich, K.: Explanation by progressive exaggeration. In: International Conference on Learning Representations (ICLR) (2019) Singla, S., Pollack, B., Chen, J., Batmanghelich, K.: Explanation by progressive exaggeration. In: International Conference on Learning Representations (ICLR) (2019)
21.
go back to reference Vig, J., et al.: Investigating gender bias in language models using causal mediation analysis. In: Larochelle, H., Ranzato, M., Hadsell, R., Balcan, M.F., Lin, H. (eds.) Advances in Neural Information Processing Systems, vol. 33, pp. 12388–12401 (2020) Vig, J., et al.: Investigating gender bias in language models using causal mediation analysis. In: Larochelle, H., Ranzato, M., Hadsell, R., Balcan, M.F., Lin, H. (eds.) Advances in Neural Information Processing Systems, vol. 33, pp. 12388–12401 (2020)
23.
go back to reference Zhou, B., Sun, Y., Bau, D., Torralba, A.: Interpretable basis decomposition for visual explanation. In: European Conference on Computer Vision (ECCV), pp. 119–134 (2018) Zhou, B., Sun, Y., Bau, D., Torralba, A.: Interpretable basis decomposition for visual explanation. In: European Conference on Computer Vision (ECCV), pp. 119–134 (2018)
Metadata
Title
Using Causal Analysis for Conceptual Deep Learning Explanation
Authors
Sumedha Singla
Stephen Wallace
Sofia Triantafillou
Kayhan Batmanghelich
Copyright Year
2021
DOI
https://doi.org/10.1007/978-3-030-87199-4_49

Premium Partner