Skip to main content

2019 | OriginalPaper | Buchkapitel

Fairness-Enhancing Interventions in Stream Classification

verfasst von : Vasileios Iosifidis, Thi Ngoc Han Tran, Eirini Ntoutsi

Erschienen in: Database and Expert Systems Applications

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The wide spread usage of automated data-driven decision support systems has raised a lot of concerns regarding accountability and fairness of the employed models in the absence of human supervision. Existing fairness-aware approaches tackle fairness as a batch learning problem and aim at learning a fair model which can then be applied to future instances of the problem. In many applications, however, the data comes sequentially and its characteristics might evolve with time. In such a setting, it is counter-intuitive to “fix” a (fair) model over the data stream as changes in the data might incur changes in the underlying model therefore, affecting its fairness. In this work, we propose fairness-enhancing interventions that modify the input data so that the outcome of any stream classifier applied to that data will be fair. Experiments on real and synthetic data show that our approach achieves good predictive performance and low discrimination scores over the course of the stream.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
Code will be made available online.
 
Literatur
1.
Zurück zum Zitat Bhandari, E.: Big data can be used to violate civil rights laws, and the FTC agrees (2016) Bhandari, E.: Big data can be used to violate civil rights laws, and the FTC agrees (2016)
3.
Zurück zum Zitat Bifet, A., Holmes, G., Kirkby, R., Pfahringer, B.: MOA: massive online analysis. JMLR 11, 1601–1604 (2010) Bifet, A., Holmes, G., Kirkby, R., Pfahringer, B.: MOA: massive online analysis. JMLR 11, 1601–1604 (2010)
5.
Zurück zum Zitat Calders, T., Kamiran, F., Pechenizkiy, M.: Building classifiers with independency constraints. In: ICDMW 2009, pp. 13–18. IEEE (2009) Calders, T., Kamiran, F., Pechenizkiy, M.: Building classifiers with independency constraints. In: ICDMW 2009, pp. 13–18. IEEE (2009)
6.
Zurück zum Zitat Calmon, F., Wei, D., Vinzamuri, B., Ramamurthy, K.N., Varshney, K.R.: Optimized pre-processing for discrimination prevention. In: NIPS, pp. 3992–4001 (2017) Calmon, F., Wei, D., Vinzamuri, B., Ramamurthy, K.N., Varshney, K.R.: Optimized pre-processing for discrimination prevention. In: NIPS, pp. 3992–4001 (2017)
7.
Zurück zum Zitat Domingos, P., Hulten, G.: Mining high-speed data streams. In: SIGKDD, pp. 71–80. ACM (2000) Domingos, P., Hulten, G.: Mining high-speed data streams. In: SIGKDD, pp. 71–80. ACM (2000)
8.
Zurück zum Zitat Dwork, C., Immorlica, N., Kalai, A.T., Leiserson, M.D.: Decoupled classifiers for group-fair and efficient machine learning. In: FAT, pp. 119–133 (2018) Dwork, C., Immorlica, N., Kalai, A.T., Leiserson, M.D.: Decoupled classifiers for group-fair and efficient machine learning. In: FAT, pp. 119–133 (2018)
9.
Zurück zum Zitat Edelman, B.G., Luca, M.: Digital discrimination: the case of airbnb.com (2014) Edelman, B.G., Luca, M.: Digital discrimination: the case of airbnb.com (2014)
10.
Zurück zum Zitat Fish, B., Kun, J., Lelkes, Á.D.: A confidence-based approach for balancing fairness and accuracy. In: SIAM, pp. 144–152 (2016) Fish, B., Kun, J., Lelkes, Á.D.: A confidence-based approach for balancing fairness and accuracy. In: SIAM, pp. 144–152 (2016)
11.
Zurück zum Zitat Forman, G.: Tackling concept drift by temporal inductive transfer. In: SIGIR, pp. 252–259. ACM (2006) Forman, G.: Tackling concept drift by temporal inductive transfer. In: SIGIR, pp. 252–259. ACM (2006)
12.
Zurück zum Zitat Gama, J.: Knowledge Discovery from Data Streams. CRC Press, Boca Raton (2010)CrossRef Gama, J.: Knowledge Discovery from Data Streams. CRC Press, Boca Raton (2010)CrossRef
13.
Zurück zum Zitat Ingold, D., Soper, S.: Amazon doesn’t consider the race of its customers. Should it. Bloomberg, April 2016 Ingold, D., Soper, S.: Amazon doesn’t consider the race of its customers. Should it. Bloomberg, April 2016
14.
Zurück zum Zitat Iosifidis, V., Ntoutsi, E.: Dealing with bias via data augmentation in supervised learning scenarios. Jo Bates Paul D. Clough Robert Jäschke, p. 24 (2018) Iosifidis, V., Ntoutsi, E.: Dealing with bias via data augmentation in supervised learning scenarios. Jo Bates Paul D. Clough Robert Jäschke, p. 24 (2018)
15.
Zurück zum Zitat Kamiran, F., Calders, T.: Classifying without discriminating. In: IC4. IEEE (2009) Kamiran, F., Calders, T.: Classifying without discriminating. In: IC4. IEEE (2009)
16.
Zurück zum Zitat Kamiran, F., Calders, T.: Classification with no discrimination by preferential sampling. In: BeneLearn, pp. 1–6. Citeseer (2010) Kamiran, F., Calders, T.: Classification with no discrimination by preferential sampling. In: BeneLearn, pp. 1–6. Citeseer (2010)
17.
Zurück zum Zitat Kamiran, F., Calders, T.: Data preprocessing techniques for classification without discrimination. KAIS 33(1), 1–33 (2012) Kamiran, F., Calders, T.: Data preprocessing techniques for classification without discrimination. KAIS 33(1), 1–33 (2012)
18.
Zurück zum Zitat Kamiran, F., Calders, T., Pechenizkiy, M.: Discrimination aware decision tree learning. In: ICDM, pp. 869–874. IEEE (2010) Kamiran, F., Calders, T., Pechenizkiy, M.: Discrimination aware decision tree learning. In: ICDM, pp. 869–874. IEEE (2010)
19.
Zurück zum Zitat Klinkenberg, R.: Learning drifting concepts: example selection vs. example weighting. IDA 8, 281–300 (2004)CrossRef Klinkenberg, R.: Learning drifting concepts: example selection vs. example weighting. IDA 8, 281–300 (2004)CrossRef
20.
Zurück zum Zitat Liu, L.T., Dean, S., Rolf, E., Simchowitz, M., Hardt, M.: Delayed impact of fair machine learning. arXiv preprint arXiv:1803.04383 (2018) Liu, L.T., Dean, S., Rolf, E., Simchowitz, M., Hardt, M.: Delayed impact of fair machine learning. arXiv preprint arXiv:​1803.​04383 (2018)
21.
Zurück zum Zitat Merz, C.J., Murphy, P.M.: \(\{\)UCI\(\}\) repository of machine learning databases (1998) Merz, C.J., Murphy, P.M.: \(\{\)UCI\(\}\) repository of machine learning databases (1998)
22.
Zurück zum Zitat Pedreschi, D., Ruggieri, S., Turini, F.: Measuring discrimination in socially-sensitive decision records. In: SDM, pp. 581–592. SIAM (2009) Pedreschi, D., Ruggieri, S., Turini, F.: Measuring discrimination in socially-sensitive decision records. In: SDM, pp. 581–592. SIAM (2009)
23.
Zurück zum Zitat Pedreshi, D., Ruggieri, S., Turini, F.: Discrimination-aware data mining. In: SIGKDD, pp. 560–568. ACM (2008) Pedreshi, D., Ruggieri, S., Turini, F.: Discrimination-aware data mining. In: SIGKDD, pp. 560–568. ACM (2008)
24.
Zurück zum Zitat USA: Executive Office of the President, Podesta, J.: Big data: seizing opportunities, preserving values. White House, Executive Office of the President (2014) USA: Executive Office of the President, Podesta, J.: Big data: seizing opportunities, preserving values. White House, Executive Office of the President (2014)
25.
Zurück zum Zitat Romei, A., Ruggieri, S.: A multidisciplinary survey on discrimination analysis. TKER 29(5), 582–638 (2014) Romei, A., Ruggieri, S.: A multidisciplinary survey on discrimination analysis. TKER 29(5), 582–638 (2014)
26.
Zurück zum Zitat Schlimmer, J.C., Granger, R.H.: Beyond incremental processing: tracking concept drift. In: AAAI, pp. 502–507 (1986) Schlimmer, J.C., Granger, R.H.: Beyond incremental processing: tracking concept drift. In: AAAI, pp. 502–507 (1986)
27.
Zurück zum Zitat Stoyanovich, J., Yang, K., Jagadish, H.: Online set selection with fairness and diversity constraints. In: EDBT (2018) Stoyanovich, J., Yang, K., Jagadish, H.: Online set selection with fairness and diversity constraints. In: EDBT (2018)
28.
Zurück zum Zitat Sweeney, L.: Discrimination in online ad delivery. Queue 11(3), 10 (2013)CrossRef Sweeney, L.: Discrimination in online ad delivery. Queue 11(3), 10 (2013)CrossRef
29.
Zurück zum Zitat Verma, S., Rubin, J.: Fairness definitions explained (2018) Verma, S., Rubin, J.: Fairness definitions explained (2018)
30.
Zurück zum Zitat Zafar, M.B., Valera, I., Gomez Rodriguez, M., Gummadi, K.P.: Fairness constraints: mechanisms for fair classification. arXiv preprint arXiv:1507.05259 (2017) Zafar, M.B., Valera, I., Gomez Rodriguez, M., Gummadi, K.P.: Fairness constraints: mechanisms for fair classification. arXiv preprint arXiv:​1507.​05259 (2017)
Metadaten
Titel
Fairness-Enhancing Interventions in Stream Classification
verfasst von
Vasileios Iosifidis
Thi Ngoc Han Tran
Eirini Ntoutsi
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-030-27615-7_20

Premium Partner