nach oben

Information Systems Frontiers

Erschienen in:

22.05.2021

Coalitional Strategies for Efficient Individual Prediction Explanation

verfasst von: Gabriel Ferrettini, Elodie Escriva, Julien Aligon, Jean-Baptiste Excoffier, Chantal Soulé-Dupuy

Erschienen in: Information Systems Frontiers | Ausgabe 1/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

As Machine Learning (ML) is now widely applied in many domains, in both research and industry, an understanding of what is happening inside the black box is becoming a growing demand, especially by non-experts of these models. Several approaches had thus been developed to provide clear insights of a model prediction for a particular observation but at the cost of long computation time or restrictive hypothesis that does not fully take into account interaction between attributes. This paper provides methods based on the detection of relevant groups of attributes -named coalitions- influencing a prediction and compares them with the literature. Our results show that these coalitional methods are more efficient than existing ones such as SHapley Additive exPlanation (SHAP). Computation time is shortened while preserving an acceptable accuracy of individual prediction explanations. Therefore, this enables wider practical use of explanation methods to increase trust between developed ML models, end-users, and whoever impacted by any decision where these models played a role.

Vorheriger Artikel Enhancing Cubes with Models to Describe Multidimensional Data

Nächster Artikel Indian Travellers’ Adoption of Airbnb Platform

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Clinical app that predicts an aggravation risk for a patient hospitalized with Covid-19. Attribute influences are computed with SHAP. https://scorecovid.kaduceo.com/

Available in https://www.openml.org/s/107/tasks

https://scikit-learn.org/stable/

https://www.chicreteil.fr/

Adadi, A., & Berrada, M. (2018). Peeking inside the black-box: A survey on explainable artificial intelligence (xai). IEEE Access, 6, 52138–52160.CrossRef

Altmann, A., Toloşi, L., Sander, O., & Lengauer, T. (2010). Permutation importance: a corrected feature importance measure. Bioinformatics, 26(10), 1340–1347.CrossRef

Bach, S., Binder, A., Montavon, G., Klauschen, F., Müller, K.-R., & Samek, W. (2015). On pixel-wise explanations for non-linear classifier decisions by layer-wise relevance propagation. PLOS ONE, 10(7), 1–46. https://doi.org/10.1371/journal.pone.0130140.

Bibault, J.-E., Chang, D., & Xing, L. (2020). Development and validation of a model to predict survival in colorectal cancer using a gradient-boosted machine. Gut. https://doi.org/10.1136/gutjnl-2020-321799.

Bolón-Canedo, V., Sánchez-Maroño, N., & Alonso-Betanzos, A. (2013). A review of feature selection methods on synthetic data. Knowledge and Information Systems, 34 (3), 483–519. https://doi.org/10.1007/s10115-012-0487-8 (English).CrossRef

Carvalho, D. V., Pereira, E. M., & Cardoso, J. S. (2019). Machine learning interpretability: A survey on methods and metrics. Electronics, 8(8), 832.CrossRef

Casalicchio, G., Molnar, C., & Bischl, B. (2018). Visualizing the feature importance for black box models. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases (pp. 655–670): Springer.

Datta, A, Sen, S, & Zick, Y. (2016). Algorithmic transparency via quantitative input influence: Theory and experiments with learning systems. In 2016 IEEE Symposium on Security and Privacy (SP) (pp. 598–617).

den Broeck, G. V., Lykov, A., Schleich, M., & Suciu, D. (2020). On the tractability of shap explanations.

Eitzinger, S., Asif, A., Watters, K. E., Iavarone, A. T., Knott, G. J., Doudna, J. A., & Minhas, F. A. A. (2020). Machine learning predicts new anti-CRISPR proteins. Nucleic Acids Research, 48(9), 4698–4708. https://doi.org/10.1093/nar/gkaa219.CrossRef

ElShawi, R., Sherif, Y., Al-Mallah, M., & Sakr, S. (2020). Interpretability in healthcare: A comparative study of local machine learning interpretability techniques. Computational Intelligence.

Ferrettini, G., Aligon, J., & Soulé-Dupuy, C. (2020a). Explaining single predictions: A faster method. In Chatzigeorgiou, A., Dondi, R., Herodotou, H., Kapoutsis, C., Manolopoulos, Y., Papadopoulos, G.A., & Sikora, F. (Eds.) SOFSEM 2020: Theory and Practice of Computer Science (pp. 313–324). Cham: Springer International Publishing.

Ferrettini, G., Aligon, J., & Soulé-Dupuy, C. (2020b). Improving on coalitional prediction explanation. In Darmont, J., Novikov, B., & Wrembel, R. (Eds.) Advances in Databases and Information Systems - 24th European Conference, ADBIS 2020, Proceedings, Lecture Notes in Computer Science. https://doi.org/10.1007/978-3-030-54832-2_11, (Vol. 12245 pp. 122–135). Lyon: Springer.

Francone, M., Iafrate, F., Masci, G. M., Coco, S., Cilia, F., Manganaro, L., Panebianco, V., Andreoli, C., Colaiacomo, M. C., Zingaropoli, M. A., & et al. (2020). Chest ct score in covid-19 patients: correlation with disease severity and short-term prognosis. European Radiology, 30(12), 6808–6817.CrossRef

Hall, M. A. (1999). Correlation-based feature selection for machine learning. Ph.D. Thesis.

Henelius, A., Puolamäki, K., & Ukkonen, A. (2017). Interpreting classifiers through attribute interactions in datasets. arXiv:1707.07576.

Henelius, A., Puolamaki, K., Boström, H., Asker, L., & Papapetrou, P. (2014). A peek into the black box : exploring classifiers by randomization. Data Mining and Knowledge Discovery, 28(5-6), 1503–1529. QC 20180119.CrossRef

Kira, K., & Rendell, L. A. (1992). A practical approach to feature selection. In Machine Learning Proceedings 1992 (pp. 249–256): Elsevier.

Lauritsen, S. M., Kristensen, M., Olsen, M. V., Larsen, M. S., Lauritsen, K. M., Jørgensen, M.J., Lange, J., & Thiesson, B. (2020). Explainable artificial intelligence model to predict acute critical illness from electronic health records. Nature Communications, 11(1), 1– 11.CrossRef

Lipovetsky, S., & Conklin, M. (2001). Analysis of regression in game theory approach. Applied Stochastic Models in Business and Industry, 17, 319–330. https://doi.org/10.1002/asmb.446.CrossRef

Lundberg, S. M., & Lee, S-I. (2017a). Consistent feature attribution for tree ensembles. arXiv:1706.06060.

Lundberg, S. M., & Lee, S.-I. (2017b). A Unified Approach to Interpreting Model Predictions. In Guyon, I., Luxburg, U.V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., & Garnett, R. (Eds.) Advances in Neural Information Processing Systems 30. http://papers.nips.cc/paper/7062-a-unified-approach-to-interpreting-model-predictions.pdf (pp. 4765–4774): Curran Associates, Inc.

Makki, S. (2019). An efficient classification model for analyzing skewed data to detect frauds in the financial sector. Ph.D. Thesis, Université de Lyon; Université libanaise.

Mejía-Lavalle, M, Sucar, E., & Arroyo, G. (2006). Variable selection using svm based criteria. In International workshop on feature selection for data mining (pp. 131–1350).

Rakotomamonjy, A. (2003). Variable selection using svm based criteria. Journal of Machine Learning Research, 3(null), 1357–1370.

Ribeiro, M. T., Singh, S., & Guestrin, C. (2016a). “why should i trust you?” explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining (pp. 1135–1144).

Ribeiro, M. T., Singh, S., & Guestrin, C. (2016b). “why should i trust you?”: Explaining the predictions of any classifier. In Proceedings of the 22Nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’16 (pp. 1135–1144). New York: ACM.

Robnik-Sikonja, M., & Bohanec, M. (2018). Perturbation-Based Explanations of Prediction Models. In Human and Machine Learning (pp. 159–175).

Shapley, L. S. (1953). A value for n-person games. Contributions to the Theory of Games (28), 307–317.

Shrikumar, A., Greenside, P., & Kundaje, A. (2017). Learning Important Features Through Propagating Activation Differences. In Proceedings of the 34th International Conference on Machine Learning - Volume 70, ICML’17. event-place: Sydney, NSW, Australia (pp. 3145–3153).

Štrumbelj, E., & Kononenko, I. (2008). Towards a model independent method for explaining classification for individual instances. In International Conference on Data Warehousing and Knowledge Discovery (pp. 273–282): Springer.

Strumbelj, E., & Kononenko, I. (2010). An Efficient Explanation of Individual Classifications Using Game Theory. Journal of Machine Learning Research, 11, 1–18. Publisher: JMLR.org.

Strumbelj, E., & Kononenko, I. (2013). Explaining prediction models and individual predictions with feature contributions. Knowledge and Information Systems, 41, 647–665.CrossRef

Tjoa, E., & Guan, C. (2020). A survey on explainable artificial intelligence (xai): Toward medical xai. IEEE Transactions on Neural Networks and Learning Systems, 1–21. https://doi.org/10.1109/TNNLS.2020.3027314.

Vanschoren, J., van Rijn, J. N., Bischl, B., & Torgo, L. (2013). Openml: Networked science in machine learning. SIGKDD Explorations, 15(2), 49–60.CrossRef

Wachter, S., Mittelstadt, B., & Russell, C. (2017). Counterfactual explanations without opening the black box: Automated decisions and the gdpr. Harv. JL & Tech., 31, 841.

Wexler, J., Pushkarna, M., Bolukbasi, T., Wattenberg, M., Viégas, F., & Wilson, J. (2019). The what-if tool: Interactive probing of machine learning models. IEEE Transactions on Visualization and Computer Graphics, 26(1), 56–65.

Yu, L., & Liu, H. (2004). Efficient feature selection via analysis of relevance and redundancy. Journal of Machine Learning Research, 5, 1205–1224.

Zheng, Z., Peng, F., Xu, B., Zhao, J., Liu, H., Peng, J., Li, Q., Jiang, C., Zhou, Y., Liu, S., & et al. (2020). Risk factors of critical & mortal covid-19 cases: A systematic literature review and meta-analysis. Journal of Infection.

Titel: Coalitional Strategies for Efficient Individual Prediction Explanation
verfasst von: Gabriel Ferrettini
Elodie Escriva
Julien Aligon
Jean-Baptiste Excoffier
Chantal Soulé-Dupuy
Publikationsdatum: 22.05.2021
Verlag: Springer US
Erschienen in: Information Systems Frontiers / Ausgabe 1/2022
Print ISSN: 1387-3326
Elektronische ISSN: 1572-9419
DOI: https://doi.org/10.1007/s10796-021-10141-9

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Weitere Artikel der Ausgabe 1/2022

Context Modeling for the Adaption of Mobile Business Processes – An Empirical Usability Evaluation

A Deeper Look at Cloud Adoption Trajectory and Dilemma

Speeding Up Reachability Queries in Public Transport Networks Using Graph Partitioning

Usage Continuance in Software-as-a-Service

Something’s Missing? A Procedure for Extending Item Content Data Sets in the Context of Recommender Systems

Discovering Primary Medical Procedures and their Associations with Other Procedures in HCUP Data

Premium Partner