Skip to main content

2017 | OriginalPaper | Buchkapitel

Fair Kernel Learning

verfasst von : Adrián Pérez-Suay, Valero Laparra, Gonzalo Mateo-García, Jordi Muñoz-Marí, Luis Gómez-Chova, Gustau Camps-Valls

Erschienen in: Machine Learning and Knowledge Discovery in Databases

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

New social and economic activities massively exploit big data and machine learning algorithms to do inference on people’s lives. Applications include automatic curricula evaluation, wage determination, and risk assessment for credits and loans. Recently, many governments and institutions have raised concerns about the lack of fairness, equity and ethics in machine learning to treat these problems. It has been shown that not including sensitive features that bias fairness, such as gender or race, is not enough to mitigate the discrimination when other related features are included. Instead, including fairness in the objective function has been shown to be more efficient.
We present novel fair regression and dimensionality reduction methods built on a previously proposed fair classification framework. Both methods rely on using the Hilbert Schmidt independence criterion as the fairness term. Unlike previous approaches, this allows us to simplify the problem and to use multiple sensitive variables simultaneously. Replacing the linear formulation by kernel functions allows the methods to deal with nonlinear problems. For both linear and nonlinear formulations the solution reduces to solving simple matrix inversions or generalized eigenvalue problems. This simplifies the evaluation of the solutions for different trade-off values between the predictive error and fairness terms. We illustrate the usefulness of the proposed methods in toy examples, and evaluate their performance on real world datasets to predict income using gender and/or race discrimination as sensitive variables, and contraceptive method prediction under demographic and socio-economic sensitive descriptors.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
2
The covariance matrix is \({\mathcal C}_{{\mathbf y}\mathbf{s}} = {\mathbb E}_{{\mathbf y}\mathbf{s}}({\mathbf y}\mathbf{s}^\top ) - {\mathbb E}_{{\mathbf y}}({\mathbf y}){\mathbb E}_{\mathbf{s}}(\mathbf{s}^\top )\), where \({\mathbb E}_{{\mathbf y}\mathbf{s}}\) is the expectation with respect to \({\mathbb P}_{{\mathbf y}\mathbf{s}}\), and \({\mathbb E}_{{\mathbf y}}\) is the marginal expectation with respect to \({\mathbb P}_{{\mathbf y}}\) (hereafter we assume that all these quantities exist).
 
Literatur
1.
Zurück zum Zitat Arenas-García, J., Petersen, K.B., Hansen, L.K.: Sparse kernel orthonormalized PLS for feature extraction in large data sets. In: NIPS, vol. 19. MIT Press (2007) Arenas-García, J., Petersen, K.B., Hansen, L.K.: Sparse kernel orthonormalized PLS for feature extraction in large data sets. In: NIPS, vol. 19. MIT Press (2007)
2.
Zurück zum Zitat Arenas-García, J., Camps-Valls, G.: Efficient kernel orthonormalized PLS for remote sensing applications. IEEE Trans. Geos. Remote Sens. 46, 2872–2881 (2008)CrossRef Arenas-García, J., Camps-Valls, G.: Efficient kernel orthonormalized PLS for remote sensing applications. IEEE Trans. Geos. Remote Sens. 46, 2872–2881 (2008)CrossRef
3.
Zurück zum Zitat Barocas, S., Selbst, A.D.: Big Data’s Disparate Impact. SSRN eLibrary (2014) Barocas, S., Selbst, A.D.: Big Data’s Disparate Impact. SSRN eLibrary (2014)
4.
Zurück zum Zitat Brennan, T., Dieterich, W., Ehret, B.: Evaluating the predictive validity of the compas risk and needs assessment system. Crim. Justice Behav. 36(1), 21–40 (2009)CrossRef Brennan, T., Dieterich, W., Ehret, B.: Evaluating the predictive validity of the compas risk and needs assessment system. Crim. Justice Behav. 36(1), 21–40 (2009)CrossRef
5.
Zurück zum Zitat Chouldechova, A.: Fair prediction with disparate impact: a study of bias in recidivism prediction instruments. CoRR abs/1610.07524 (2016) Chouldechova, A.: Fair prediction with disparate impact: a study of bias in recidivism prediction instruments. CoRR abs/1610.07524 (2016)
6.
Zurück zum Zitat Clarke, K.A.: The phantom menace: omitted variable bias in econometric research. Conflict Manag. Peace Sci. 22(4), 341–352 (2005)CrossRef Clarke, K.A.: The phantom menace: omitted variable bias in econometric research. Conflict Manag. Peace Sci. 22(4), 341–352 (2005)CrossRef
7.
Zurück zum Zitat Corbett-Davies, S., Pierson, E., Feller, A., Goel, S., Huq, A.: Algorithmic decision making and the cost of fairness. CoRR abs/1701.08230 (2017) Corbett-Davies, S., Pierson, E., Feller, A., Goel, S., Huq, A.: Algorithmic decision making and the cost of fairness. CoRR abs/1701.08230 (2017)
8.
Zurück zum Zitat Cunningham, M.D., Sorensen, J.R.: Actuarial models for assessing prison violence risk. Assessment 13(3), 253–265 (2006)CrossRef Cunningham, M.D., Sorensen, J.R.: Actuarial models for assessing prison violence risk. Assessment 13(3), 253–265 (2006)CrossRef
9.
Zurück zum Zitat Dieterich, W., Mendoza, C., Brennan, T.: COMPAS risk scales: demonstrating accuracy equity and predictive parity. Working paper, Northpointe Inc., Res. Dep. (2016) Dieterich, W., Mendoza, C., Brennan, T.: COMPAS risk scales: demonstrating accuracy equity and predictive parity. Working paper, Northpointe Inc., Res. Dep. (2016)
10.
Zurück zum Zitat Dimitrakakis, C., Liu, Y., Parkes, D., Radanovic, G.: Subjective fairness: fairness is in the eye of the beholder. Technical report arXiv: 1706.00119 (2017) Dimitrakakis, C., Liu, Y., Parkes, D., Radanovic, G.: Subjective fairness: fairness is in the eye of the beholder. Technical report arXiv:​ 1706.​00119 (2017)
11.
Zurück zum Zitat Dwork, C., Hardt, M., Pitassi, T., Reingold, O., Zemel, R.: Fairness through awareness. In: ITCS 2012, pp. 214–226. ACM, New York (2012) Dwork, C., Hardt, M., Pitassi, T., Reingold, O., Zemel, R.: Fairness through awareness. In: ITCS 2012, pp. 214–226. ACM, New York (2012)
12.
Zurück zum Zitat Feldman, M., Friedler, S.A., Moeller, J., Scheidegger, C., Venkatasubramanian, S.: Certifying and removing disparate impact. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2015, pp. 259–268. ACM, New York (2015) Feldman, M., Friedler, S.A., Moeller, J., Scheidegger, C., Venkatasubramanian, S.: Certifying and removing disparate impact. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2015, pp. 259–268. ACM, New York (2015)
13.
Zurück zum Zitat Gómez-Chova, L., Nielsen, A.A., Camps-Valls, G.: Explicit signal to noise ratio in reproducing kernel Hilbert spaces. In: IGARSS, pp. 3570–3573, July 2011 Gómez-Chova, L., Nielsen, A.A., Camps-Valls, G.: Explicit signal to noise ratio in reproducing kernel Hilbert spaces. In: IGARSS, pp. 3570–3573, July 2011
14.
Zurück zum Zitat Gretton, A., Herbrich, R., Hyvärinen, A.: Kernel methods for measuring independence. J. Mach. Learn. Res. 6, 2075–2129 (2005)MathSciNetMATH Gretton, A., Herbrich, R., Hyvärinen, A.: Kernel methods for measuring independence. J. Mach. Learn. Res. 6, 2075–2129 (2005)MathSciNetMATH
15.
Zurück zum Zitat Hoffman, M., Kahn, L.B., Li, D.: Discretion in hiring, Working Paper 16–055. Harvard Business School (2015) Hoffman, M., Kahn, L.B., Li, D.: Discretion in hiring, Working Paper 16–055. Harvard Business School (2015)
16.
Zurück zum Zitat Kamiran, F., Calders, T.: Classifying without discriminating. In: 2009 2nd International Conference on Computer, Control and Communication, pp. 1–6, February 2009 Kamiran, F., Calders, T.: Classifying without discriminating. In: 2009 2nd International Conference on Computer, Control and Communication, pp. 1–6, February 2009
17.
Zurück zum Zitat Kamiran, F., Calders, T.: Data preprocessing techniques for classification without discrimination. Knowl. Inf. Syst. 33(1), 1–33 (2012)CrossRef Kamiran, F., Calders, T.: Data preprocessing techniques for classification without discrimination. Knowl. Inf. Syst. 33(1), 1–33 (2012)CrossRef
18.
Zurück zum Zitat Kamishima, T., Akaho, S., Asoh, H., Sakuma, J.: The independence of fairness-aware classifiers. In: 2013 IEEE 13th International Conference on Data Mining Work, pp. 849–858 (2013) Kamishima, T., Akaho, S., Asoh, H., Sakuma, J.: The independence of fairness-aware classifiers. In: 2013 IEEE 13th International Conference on Data Mining Work, pp. 849–858 (2013)
20.
Zurück zum Zitat Lichman, M.: UCI machine learning repository (2013) Lichman, M.: UCI machine learning repository (2013)
22.
Zurück zum Zitat Pedreschi, D., Ruggieri, S., Turini, F.: Discrimination-aware data mining. In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2008, pp. 560–568. ACM, New York (2008) Pedreschi, D., Ruggieri, S., Turini, F.: Discrimination-aware data mining. In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2008, pp. 560–568. ACM, New York (2008)
23.
Zurück zum Zitat Ristanoski, G., Liu, W., Bailey, J.: Discrimination aware classification for imbalanced datasets. In: CIKM 2013, pp. 1529–1532. ACM, New York (2013) Ristanoski, G., Liu, W., Bailey, J.: Discrimination aware classification for imbalanced datasets. In: CIKM 2013, pp. 1529–1532. ACM, New York (2013)
24.
Zurück zum Zitat Ruggieri, S., Pedreschi, D., Turini, F.: Data mining for discrimination discovery. ACM Trans. Knowl. Discov. Data 4(2), 9:1–9:40 (2010)CrossRef Ruggieri, S., Pedreschi, D., Turini, F.: Data mining for discrimination discovery. ACM Trans. Knowl. Discov. Data 4(2), 9:1–9:40 (2010)CrossRef
25.
Zurück zum Zitat Schölkopf, B., Smola, A.: Learning with Kernels - Support Vector Machines, Regularization, Optimization and Beyond. MIT Press Series, Cambridge (2002) Schölkopf, B., Smola, A.: Learning with Kernels - Support Vector Machines, Regularization, Optimization and Beyond. MIT Press Series, Cambridge (2002)
26.
Zurück zum Zitat Shawe-Taylor, J., Cristianini, N.: Kernel Methods for Pattern Analysis. CUP, Cambridge (2004)CrossRefMATH Shawe-Taylor, J., Cristianini, N.: Kernel Methods for Pattern Analysis. CUP, Cambridge (2004)CrossRefMATH
27.
Zurück zum Zitat Worsley, K.J., Poline, J.B., Friston, K.J., Evans, A.C.: Characterizing the response of pet and fMRI data using multivariate linear models. Neuroimage 6, 305–319 (1998)CrossRef Worsley, K.J., Poline, J.B., Friston, K.J., Evans, A.C.: Characterizing the response of pet and fMRI data using multivariate linear models. Neuroimage 6, 305–319 (1998)CrossRef
29.
Zurück zum Zitat Zafar, M.B., Valera, I., Gomez Rodriguez, M., Gummadi, K.P.: Fairness constraints: mechanisms for fair classification. In: Singh, A., Zhu, J. (eds.) Proceedings of the 20th International Conference on Artificial Intelligence and Statistics Proceedings of Machine Learning Research, vol. 54, pp. 962–970. PMLR, Fort Lauderdale, FL, USA, 20–22 April 2017 Zafar, M.B., Valera, I., Gomez Rodriguez, M., Gummadi, K.P.: Fairness constraints: mechanisms for fair classification. In: Singh, A., Zhu, J. (eds.) Proceedings of the 20th International Conference on Artificial Intelligence and Statistics Proceedings of Machine Learning Research, vol. 54, pp. 962–970. PMLR, Fort Lauderdale, FL, USA, 20–22 April 2017
30.
Zurück zum Zitat Zemel, R.S., Wu, Y., Swersky, K., Pitassi, T., Dwork, C.: Learning fair representations. In: ICML (3), vol. 28, pp. 325–333 (2013) Zemel, R.S., Wu, Y., Swersky, K., Pitassi, T., Dwork, C.: Learning fair representations. In: ICML (3), vol. 28, pp. 325–333 (2013)
31.
Zurück zum Zitat Zeng, J., Ustun, B., Rudin, C.: Interpretable classification models for recidivism prediction. J. R. Stat. Soc. Ser. A (Stat. Soc.) 180, 689–722 (2016)MathSciNetCrossRef Zeng, J., Ustun, B., Rudin, C.: Interpretable classification models for recidivism prediction. J. R. Stat. Soc. Ser. A (Stat. Soc.) 180, 689–722 (2016)MathSciNetCrossRef
Metadaten
Titel
Fair Kernel Learning
verfasst von
Adrián Pérez-Suay
Valero Laparra
Gonzalo Mateo-García
Jordi Muñoz-Marí
Luis Gómez-Chova
Gustau Camps-Valls
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-71249-9_21