Skip to main content
Erschienen in: Information Systems Frontiers 1/2022

22.05.2021

Coalitional Strategies for Efficient Individual Prediction Explanation

verfasst von: Gabriel Ferrettini, Elodie Escriva, Julien Aligon, Jean-Baptiste Excoffier, Chantal Soulé-Dupuy

Erschienen in: Information Systems Frontiers | Ausgabe 1/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

As Machine Learning (ML) is now widely applied in many domains, in both research and industry, an understanding of what is happening inside the black box is becoming a growing demand, especially by non-experts of these models. Several approaches had thus been developed to provide clear insights of a model prediction for a particular observation but at the cost of long computation time or restrictive hypothesis that does not fully take into account interaction between attributes. This paper provides methods based on the detection of relevant groups of attributes -named coalitions- influencing a prediction and compares them with the literature. Our results show that these coalitional methods are more efficient than existing ones such as SHapley Additive exPlanation (SHAP). Computation time is shortened while preserving an acceptable accuracy of individual prediction explanations. Therefore, this enables wider practical use of explanation methods to increase trust between developed ML models, end-users, and whoever impacted by any decision where these models played a role.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
Clinical app that predicts an aggravation risk for a patient hospitalized with Covid-19. Attribute influences are computed with SHAP. https://​scorecovid.​kaduceo.​com/​
 
Literatur
Zurück zum Zitat Adadi, A., & Berrada, M. (2018). Peeking inside the black-box: A survey on explainable artificial intelligence (xai). IEEE Access, 6, 52138–52160.CrossRef Adadi, A., & Berrada, M. (2018). Peeking inside the black-box: A survey on explainable artificial intelligence (xai). IEEE Access, 6, 52138–52160.CrossRef
Zurück zum Zitat Altmann, A., Toloşi, L., Sander, O., & Lengauer, T. (2010). Permutation importance: a corrected feature importance measure. Bioinformatics, 26(10), 1340–1347.CrossRef Altmann, A., Toloşi, L., Sander, O., & Lengauer, T. (2010). Permutation importance: a corrected feature importance measure. Bioinformatics, 26(10), 1340–1347.CrossRef
Zurück zum Zitat Carvalho, D. V., Pereira, E. M., & Cardoso, J. S. (2019). Machine learning interpretability: A survey on methods and metrics. Electronics, 8(8), 832.CrossRef Carvalho, D. V., Pereira, E. M., & Cardoso, J. S. (2019). Machine learning interpretability: A survey on methods and metrics. Electronics, 8(8), 832.CrossRef
Zurück zum Zitat Casalicchio, G., Molnar, C., & Bischl, B. (2018). Visualizing the feature importance for black box models. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases (pp. 655–670): Springer. Casalicchio, G., Molnar, C., & Bischl, B. (2018). Visualizing the feature importance for black box models. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases (pp. 655–670): Springer.
Zurück zum Zitat Datta, A, Sen, S, & Zick, Y. (2016). Algorithmic transparency via quantitative input influence: Theory and experiments with learning systems. In 2016 IEEE Symposium on Security and Privacy (SP) (pp. 598–617). Datta, A, Sen, S, & Zick, Y. (2016). Algorithmic transparency via quantitative input influence: Theory and experiments with learning systems. In 2016 IEEE Symposium on Security and Privacy (SP) (pp. 598–617).
Zurück zum Zitat den Broeck, G. V., Lykov, A., Schleich, M., & Suciu, D. (2020). On the tractability of shap explanations. den Broeck, G. V., Lykov, A., Schleich, M., & Suciu, D. (2020). On the tractability of shap explanations.
Zurück zum Zitat ElShawi, R., Sherif, Y., Al-Mallah, M., & Sakr, S. (2020). Interpretability in healthcare: A comparative study of local machine learning interpretability techniques. Computational Intelligence. ElShawi, R., Sherif, Y., Al-Mallah, M., & Sakr, S. (2020). Interpretability in healthcare: A comparative study of local machine learning interpretability techniques. Computational Intelligence.
Zurück zum Zitat Ferrettini, G., Aligon, J., & Soulé-Dupuy, C. (2020a). Explaining single predictions: A faster method. In Chatzigeorgiou, A., Dondi, R., Herodotou, H., Kapoutsis, C., Manolopoulos, Y., Papadopoulos, G.A., & Sikora, F. (Eds.) SOFSEM 2020: Theory and Practice of Computer Science (pp. 313–324). Cham: Springer International Publishing. Ferrettini, G., Aligon, J., & Soulé-Dupuy, C. (2020a). Explaining single predictions: A faster method. In Chatzigeorgiou, A., Dondi, R., Herodotou, H., Kapoutsis, C., Manolopoulos, Y., Papadopoulos, G.A., & Sikora, F. (Eds.) SOFSEM 2020: Theory and Practice of Computer Science (pp. 313–324). Cham: Springer International Publishing.
Zurück zum Zitat Ferrettini, G., Aligon, J., & Soulé-Dupuy, C. (2020b). Improving on coalitional prediction explanation. In Darmont, J., Novikov, B., & Wrembel, R. (Eds.) Advances in Databases and Information Systems - 24th European Conference, ADBIS 2020, Proceedings, Lecture Notes in Computer Science. https://doi.org/10.1007/978-3-030-54832-2_11, (Vol. 12245 pp. 122–135). Lyon: Springer. Ferrettini, G., Aligon, J., & Soulé-Dupuy, C. (2020b). Improving on coalitional prediction explanation. In Darmont, J., Novikov, B., & Wrembel, R. (Eds.) Advances in Databases and Information Systems - 24th European Conference, ADBIS 2020, Proceedings, Lecture Notes in Computer Science. https://​doi.​org/​10.​1007/​978-3-030-54832-2_​11, (Vol. 12245 pp. 122–135). Lyon: Springer.
Zurück zum Zitat Francone, M., Iafrate, F., Masci, G. M., Coco, S., Cilia, F., Manganaro, L., Panebianco, V., Andreoli, C., Colaiacomo, M. C., Zingaropoli, M. A., & et al. (2020). Chest ct score in covid-19 patients: correlation with disease severity and short-term prognosis. European Radiology, 30(12), 6808–6817.CrossRef Francone, M., Iafrate, F., Masci, G. M., Coco, S., Cilia, F., Manganaro, L., Panebianco, V., Andreoli, C., Colaiacomo, M. C., Zingaropoli, M. A., & et al. (2020). Chest ct score in covid-19 patients: correlation with disease severity and short-term prognosis. European Radiology, 30(12), 6808–6817.CrossRef
Zurück zum Zitat Hall, M. A. (1999). Correlation-based feature selection for machine learning. Ph.D. Thesis. Hall, M. A. (1999). Correlation-based feature selection for machine learning. Ph.D. Thesis.
Zurück zum Zitat Henelius, A., Puolamäki, K., & Ukkonen, A. (2017). Interpreting classifiers through attribute interactions in datasets. arXiv:1707.07576. Henelius, A., Puolamäki, K., & Ukkonen, A. (2017). Interpreting classifiers through attribute interactions in datasets. arXiv:1707.​07576.
Zurück zum Zitat Henelius, A., Puolamaki, K., Boström, H., Asker, L., & Papapetrou, P. (2014). A peek into the black box : exploring classifiers by randomization. Data Mining and Knowledge Discovery, 28(5-6), 1503–1529. QC 20180119.CrossRef Henelius, A., Puolamaki, K., Boström, H., Asker, L., & Papapetrou, P. (2014). A peek into the black box : exploring classifiers by randomization. Data Mining and Knowledge Discovery, 28(5-6), 1503–1529. QC 20180119.CrossRef
Zurück zum Zitat Kira, K., & Rendell, L. A. (1992). A practical approach to feature selection. In Machine Learning Proceedings 1992 (pp. 249–256): Elsevier. Kira, K., & Rendell, L. A. (1992). A practical approach to feature selection. In Machine Learning Proceedings 1992 (pp. 249–256): Elsevier.
Zurück zum Zitat Lauritsen, S. M., Kristensen, M., Olsen, M. V., Larsen, M. S., Lauritsen, K. M., Jørgensen, M.J., Lange, J., & Thiesson, B. (2020). Explainable artificial intelligence model to predict acute critical illness from electronic health records. Nature Communications, 11(1), 1– 11.CrossRef Lauritsen, S. M., Kristensen, M., Olsen, M. V., Larsen, M. S., Lauritsen, K. M., Jørgensen, M.J., Lange, J., & Thiesson, B. (2020). Explainable artificial intelligence model to predict acute critical illness from electronic health records. Nature Communications, 11(1), 1– 11.CrossRef
Zurück zum Zitat Lundberg, S. M., & Lee, S-I. (2017a). Consistent feature attribution for tree ensembles. arXiv:1706.06060. Lundberg, S. M., & Lee, S-I. (2017a). Consistent feature attribution for tree ensembles. arXiv:1706.​06060.
Zurück zum Zitat Makki, S. (2019). An efficient classification model for analyzing skewed data to detect frauds in the financial sector. Ph.D. Thesis, Université de Lyon; Université libanaise. Makki, S. (2019). An efficient classification model for analyzing skewed data to detect frauds in the financial sector. Ph.D. Thesis, Université de Lyon; Université libanaise.
Zurück zum Zitat Mejía-Lavalle, M, Sucar, E., & Arroyo, G. (2006). Variable selection using svm based criteria. In International workshop on feature selection for data mining (pp. 131–1350). Mejía-Lavalle, M, Sucar, E., & Arroyo, G. (2006). Variable selection using svm based criteria. In International workshop on feature selection for data mining (pp. 131–1350).
Zurück zum Zitat Rakotomamonjy, A. (2003). Variable selection using svm based criteria. Journal of Machine Learning Research, 3(null), 1357–1370. Rakotomamonjy, A. (2003). Variable selection using svm based criteria. Journal of Machine Learning Research, 3(null), 1357–1370.
Zurück zum Zitat Ribeiro, M. T., Singh, S., & Guestrin, C. (2016a). “why should i trust you?” explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining (pp. 1135–1144). Ribeiro, M. T., Singh, S., & Guestrin, C. (2016a). “why should i trust you?” explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining (pp. 1135–1144).
Zurück zum Zitat Ribeiro, M. T., Singh, S., & Guestrin, C. (2016b). “why should i trust you?”: Explaining the predictions of any classifier. In Proceedings of the 22Nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’16 (pp. 1135–1144). New York: ACM. Ribeiro, M. T., Singh, S., & Guestrin, C. (2016b). “why should i trust you?”: Explaining the predictions of any classifier. In Proceedings of the 22Nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’16 (pp. 1135–1144). New York: ACM.
Zurück zum Zitat Robnik-Sikonja, M., & Bohanec, M. (2018). Perturbation-Based Explanations of Prediction Models. In Human and Machine Learning (pp. 159–175). Robnik-Sikonja, M., & Bohanec, M. (2018). Perturbation-Based Explanations of Prediction Models. In Human and Machine Learning (pp. 159–175).
Zurück zum Zitat Shapley, L. S. (1953). A value for n-person games. Contributions to the Theory of Games (28), 307–317. Shapley, L. S. (1953). A value for n-person games. Contributions to the Theory of Games (28), 307–317.
Zurück zum Zitat Shrikumar, A., Greenside, P., & Kundaje, A. (2017). Learning Important Features Through Propagating Activation Differences. In Proceedings of the 34th International Conference on Machine Learning - Volume 70, ICML’17. event-place: Sydney, NSW, Australia (pp. 3145–3153). Shrikumar, A., Greenside, P., & Kundaje, A. (2017). Learning Important Features Through Propagating Activation Differences. In Proceedings of the 34th International Conference on Machine Learning - Volume 70, ICML’17. event-place: Sydney, NSW, Australia (pp. 3145–3153).
Zurück zum Zitat Štrumbelj, E., & Kononenko, I. (2008). Towards a model independent method for explaining classification for individual instances. In International Conference on Data Warehousing and Knowledge Discovery (pp. 273–282): Springer. Štrumbelj, E., & Kononenko, I. (2008). Towards a model independent method for explaining classification for individual instances. In International Conference on Data Warehousing and Knowledge Discovery (pp. 273–282): Springer.
Zurück zum Zitat Strumbelj, E., & Kononenko, I. (2010). An Efficient Explanation of Individual Classifications Using Game Theory. Journal of Machine Learning Research, 11, 1–18. Publisher: JMLR.org. Strumbelj, E., & Kononenko, I. (2010). An Efficient Explanation of Individual Classifications Using Game Theory. Journal of Machine Learning Research, 11, 1–18. Publisher: JMLR.org.
Zurück zum Zitat Strumbelj, E., & Kononenko, I. (2013). Explaining prediction models and individual predictions with feature contributions. Knowledge and Information Systems, 41, 647–665.CrossRef Strumbelj, E., & Kononenko, I. (2013). Explaining prediction models and individual predictions with feature contributions. Knowledge and Information Systems, 41, 647–665.CrossRef
Zurück zum Zitat Vanschoren, J., van Rijn, J. N., Bischl, B., & Torgo, L. (2013). Openml: Networked science in machine learning. SIGKDD Explorations, 15(2), 49–60.CrossRef Vanschoren, J., van Rijn, J. N., Bischl, B., & Torgo, L. (2013). Openml: Networked science in machine learning. SIGKDD Explorations, 15(2), 49–60.CrossRef
Zurück zum Zitat Wachter, S., Mittelstadt, B., & Russell, C. (2017). Counterfactual explanations without opening the black box: Automated decisions and the gdpr. Harv. JL & Tech., 31, 841. Wachter, S., Mittelstadt, B., & Russell, C. (2017). Counterfactual explanations without opening the black box: Automated decisions and the gdpr. Harv. JL & Tech., 31, 841.
Zurück zum Zitat Wexler, J., Pushkarna, M., Bolukbasi, T., Wattenberg, M., Viégas, F., & Wilson, J. (2019). The what-if tool: Interactive probing of machine learning models. IEEE Transactions on Visualization and Computer Graphics, 26(1), 56–65. Wexler, J., Pushkarna, M., Bolukbasi, T., Wattenberg, M., Viégas, F., & Wilson, J. (2019). The what-if tool: Interactive probing of machine learning models. IEEE Transactions on Visualization and Computer Graphics, 26(1), 56–65.
Zurück zum Zitat Yu, L., & Liu, H. (2004). Efficient feature selection via analysis of relevance and redundancy. Journal of Machine Learning Research, 5, 1205–1224. Yu, L., & Liu, H. (2004). Efficient feature selection via analysis of relevance and redundancy. Journal of Machine Learning Research, 5, 1205–1224.
Zurück zum Zitat Zheng, Z., Peng, F., Xu, B., Zhao, J., Liu, H., Peng, J., Li, Q., Jiang, C., Zhou, Y., Liu, S., & et al. (2020). Risk factors of critical & mortal covid-19 cases: A systematic literature review and meta-analysis. Journal of Infection. Zheng, Z., Peng, F., Xu, B., Zhao, J., Liu, H., Peng, J., Li, Q., Jiang, C., Zhou, Y., Liu, S., & et al. (2020). Risk factors of critical & mortal covid-19 cases: A systematic literature review and meta-analysis. Journal of Infection.
Metadaten
Titel
Coalitional Strategies for Efficient Individual Prediction Explanation
verfasst von
Gabriel Ferrettini
Elodie Escriva
Julien Aligon
Jean-Baptiste Excoffier
Chantal Soulé-Dupuy
Publikationsdatum
22.05.2021
Verlag
Springer US
Erschienen in
Information Systems Frontiers / Ausgabe 1/2022
Print ISSN: 1387-3326
Elektronische ISSN: 1572-9419
DOI
https://doi.org/10.1007/s10796-021-10141-9

Weitere Artikel der Ausgabe 1/2022

Information Systems Frontiers 1/2022 Zur Ausgabe

Premium Partner