nach oben

Erschienen in:

2017 | OriginalPaper | Buchkapitel

Fair Kernel Learning

verfasst von : Adrián Pérez-Suay, Valero Laparra, Gonzalo Mateo-García, Jordi Muñoz-Marí, Luis Gómez-Chova, Gustau Camps-Valls

Erschienen in: Machine Learning and Knowledge Discovery in Databases

Verlag: Springer International Publishing

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

New social and economic activities massively exploit big data and machine learning algorithms to do inference on people’s lives. Applications include automatic curricula evaluation, wage determination, and risk assessment for credits and loans. Recently, many governments and institutions have raised concerns about the lack of fairness, equity and ethics in machine learning to treat these problems. It has been shown that not including sensitive features that bias fairness, such as gender or race, is not enough to mitigate the discrimination when other related features are included. Instead, including fairness in the objective function has been shown to be more efficient.

We present novel fair regression and dimensionality reduction methods built on a previously proposed fair classification framework. Both methods rely on using the Hilbert Schmidt independence criterion as the fairness term. Unlike previous approaches, this allows us to simplify the problem and to use multiple sensitive variables simultaneously. Replacing the linear formulation by kernel functions allows the methods to deal with nonlinear problems. For both linear and nonlinear formulations the solution reduces to solving simple matrix inversions or generalized eigenvalue problems. This simplifies the evaluation of the solutions for different trade-off values between the predictive error and fairness terms. We illustrate the usefulness of the proposed methods in toy examples, and evaluate their performance on real world datasets to predict income using gender and/or race discrimination as sensitive variables, and contraceptive method prediction under demographic and socio-economic sensitive descriptors.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel Entropic Trace Estimates for Log Determinants

Nächstes Kapitel GaKCo: A Fast Gapped k-mer String Kernel Using Counting

http://www.fatml.org/.

The covariance matrix is \({\mathcal C}_{{\mathbf y}\mathbf{s}} = {\mathbb E}_{{\mathbf y}\mathbf{s}}({\mathbf y}\mathbf{s}^\top ) - {\mathbb E}_{{\mathbf y}}({\mathbf y}){\mathbb E}_{\mathbf{s}}(\mathbf{s}^\top )\), where \({\mathbb E}_{{\mathbf y}\mathbf{s}}\) is the expectation with respect to \({\mathbb P}_{{\mathbf y}\mathbf{s}}\), and \({\mathbb E}_{{\mathbf y}}\) is the marginal expectation with respect to \({\mathbb P}_{{\mathbf y}}\) (hereafter we assume that all these quantities exist).

https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html.

Arenas-García, J., Petersen, K.B., Hansen, L.K.: Sparse kernel orthonormalized PLS for feature extraction in large data sets. In: NIPS, vol. 19. MIT Press (2007)

Arenas-García, J., Camps-Valls, G.: Efficient kernel orthonormalized PLS for remote sensing applications. IEEE Trans. Geos. Remote Sens. 46, 2872–2881 (2008)CrossRef

Barocas, S., Selbst, A.D.: Big Data’s Disparate Impact. SSRN eLibrary (2014)

Brennan, T., Dieterich, W., Ehret, B.: Evaluating the predictive validity of the compas risk and needs assessment system. Crim. Justice Behav. 36(1), 21–40 (2009)CrossRef

Chouldechova, A.: Fair prediction with disparate impact: a study of bias in recidivism prediction instruments. CoRR abs/1610.07524 (2016)

Clarke, K.A.: The phantom menace: omitted variable bias in econometric research. Conflict Manag. Peace Sci. 22(4), 341–352 (2005)CrossRef

Corbett-Davies, S., Pierson, E., Feller, A., Goel, S., Huq, A.: Algorithmic decision making and the cost of fairness. CoRR abs/1701.08230 (2017)

Cunningham, M.D., Sorensen, J.R.: Actuarial models for assessing prison violence risk. Assessment 13(3), 253–265 (2006)CrossRef

Dieterich, W., Mendoza, C., Brennan, T.: COMPAS risk scales: demonstrating accuracy equity and predictive parity. Working paper, Northpointe Inc., Res. Dep. (2016)

10.

Dimitrakakis, C., Liu, Y., Parkes, D., Radanovic, G.: Subjective fairness: fairness is in the eye of the beholder. Technical report arXiv: 1706.00119 (2017)

11.

Dwork, C., Hardt, M., Pitassi, T., Reingold, O., Zemel, R.: Fairness through awareness. In: ITCS 2012, pp. 214–226. ACM, New York (2012)

12.

Feldman, M., Friedler, S.A., Moeller, J., Scheidegger, C., Venkatasubramanian, S.: Certifying and removing disparate impact. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2015, pp. 259–268. ACM, New York (2015)

13.

Gómez-Chova, L., Nielsen, A.A., Camps-Valls, G.: Explicit signal to noise ratio in reproducing kernel Hilbert spaces. In: IGARSS, pp. 3570–3573, July 2011

14.

Gretton, A., Herbrich, R., Hyvärinen, A.: Kernel methods for measuring independence. J. Mach. Learn. Res. 6, 2075–2129 (2005)MathSciNetMATH

15.

Hoffman, M., Kahn, L.B., Li, D.: Discretion in hiring, Working Paper 16–055. Harvard Business School (2015)

16.

Kamiran, F., Calders, T.: Classifying without discriminating. In: 2009 2nd International Conference on Computer, Control and Communication, pp. 1–6, February 2009

17.

Kamiran, F., Calders, T.: Data preprocessing techniques for classification without discrimination. Knowl. Inf. Syst. 33(1), 1–33 (2012)CrossRef

18.

Kamishima, T., Akaho, S., Asoh, H., Sakuma, J.: The independence of fairness-aware classifiers. In: 2013 IEEE 13th International Conference on Data Mining Work, pp. 849–858 (2013)

19.

Kamishima, T., Akaho, S., Asoh, H., Sakuma, J.: Fairness-aware classifier with prejudice remover regularizer. In: Flach, P.A., De Bie, T., Cristianini, N. (eds.) ECML PKDD 2012. LNCS (LNAI), vol. 7524, pp. 35–50. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-33486-3_3 CrossRef

20.

Lichman, M.: UCI machine learning repository (2013)

21.

Luo, L., Liu, W., Koprinska, I., Chen, F.: Discrimination-aware association rule mining for unbiased data analytics. In: Madria, S., Hara, T. (eds.) DaWaK 2015. LNCS, vol. 9263, pp. 108–120. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-22729-0_9 CrossRef

22.

Pedreschi, D., Ruggieri, S., Turini, F.: Discrimination-aware data mining. In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2008, pp. 560–568. ACM, New York (2008)

23.

Ristanoski, G., Liu, W., Bailey, J.: Discrimination aware classification for imbalanced datasets. In: CIKM 2013, pp. 1529–1532. ACM, New York (2013)

24.

Ruggieri, S., Pedreschi, D., Turini, F.: Data mining for discrimination discovery. ACM Trans. Knowl. Discov. Data 4(2), 9:1–9:40 (2010)CrossRef

25.

Schölkopf, B., Smola, A.: Learning with Kernels - Support Vector Machines, Regularization, Optimization and Beyond. MIT Press Series, Cambridge (2002)

26.

Shawe-Taylor, J., Cristianini, N.: Kernel Methods for Pattern Analysis. CUP, Cambridge (2004)CrossRefMATH

27.

Worsley, K.J., Poline, J.B., Friston, K.J., Evans, A.C.: Characterizing the response of pet and fMRI data using multivariate linear models. Neuroimage 6, 305–319 (1998)CrossRef

28.

Zafar, M.B., Valera, I., Rodriguez, M.G., Gummadi, K.P.: Learning fair classifiers, May 2016. http://arxiv.org/abs/1507.05259

29.

Zafar, M.B., Valera, I., Gomez Rodriguez, M., Gummadi, K.P.: Fairness constraints: mechanisms for fair classification. In: Singh, A., Zhu, J. (eds.) Proceedings of the 20th International Conference on Artificial Intelligence and Statistics Proceedings of Machine Learning Research, vol. 54, pp. 962–970. PMLR, Fort Lauderdale, FL, USA, 20–22 April 2017

30.

Zemel, R.S., Wu, Y., Swersky, K., Pitassi, T., Dwork, C.: Learning fair representations. In: ICML (3), vol. 28, pp. 325–333 (2013)

31.

Zeng, J., Ustun, B., Rudin, C.: Interpretable classification models for recidivism prediction. J. R. Stat. Soc. Ser. A (Stat. Soc.) 180, 689–722 (2016)MathSciNetCrossRef

Titel: Fair Kernel Learning
verfasst von: Adrián Pérez-Suay
Valero Laparra
Gonzalo Mateo-García
Jordi Muñoz-Marí
Luis Gómez-Chova
Gustau Camps-Valls
Verlag: Springer International Publishing
Buch: Machine Learning and Knowledge Discovery in Databases
Print ISBN: 978-3-319-71248-2

Electronic ISBN: 978-3-319-71249-9

Copyright-Jahr: 2017
DOI: https://doi.org/10.1007/978-3-319-71249-9_21

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"