Skip to main content

2018 | OriginalPaper | Buchkapitel

Identifying Exceptional Descriptions of People Using Topic Modeling and Subgroup Discovery

verfasst von : Andrew T. Hendrickson, Jason Wang, Martin Atzmueller

Erschienen in: Foundations of Intelligent Systems

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Descriptions of images form the backbone for many intelligent systems, assuming descriptions that randomly vary in construction and content, but where description content is homogeneous. This assumption becomes problematic being extended to descriptions of images of people [14], where people are known to show systematic biases in how they process others [19]. Therefore, this paper presents a novel approach for discovering exceptional subgroups of descriptions in which the content of those descriptions reliably differs from the general set of descriptions. We develop a novel interestingness measure for subgroup discovery appropriate for probability distributions across semantic representations. The proposed method is applied to a web-based experiment in which 500 raters describe images of 200 people. Our analysis identifies multiple exceptional subgroups and the attributes of the respective raters and images. We further discuss implications for intelligent systems.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
The difference between documents was calculated as the sum across all pairs of descriptions of the cosine similarity of the topic probability distributions. The number of topics per document was calculated as the sum across all descriptions of the conditional entropy of the topic probability distribution.
 
Literatur
1.
Zurück zum Zitat Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proceedings of VLDB, pp. 487–499. Morgan Kaufmann (1994) Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proceedings of VLDB, pp. 487–499. Morgan Kaufmann (1994)
2.
Zurück zum Zitat Antol, S., et al.: VQA: visual question answering. In: Proceedings of IEEE ICCV, pp. 2425–2433 (2015) Antol, S., et al.: VQA: visual question answering. In: Proceedings of IEEE ICCV, pp. 2425–2433 (2015)
3.
Zurück zum Zitat Atzmueller, M.: Subgroup discovery. WIREs DMKD 5(1), 35–49 (2015) Atzmueller, M.: Subgroup discovery. WIREs DMKD 5(1), 35–49 (2015)
5.
Zurück zum Zitat Atzmueller, M., Lemmerich, F.: Exploratory pattern mining on social media using geo-references and social tagging information. IJWS 2(1/2), 80–112 (2013)CrossRef Atzmueller, M., Lemmerich, F.: Exploratory pattern mining on social media using geo-references and social tagging information. IJWS 2(1/2), 80–112 (2013)CrossRef
7.
Zurück zum Zitat Bayardo, R., Agrawal, R., Gunopulos, D.: Constraint-based rule mining in large, dense databases. Data Min. Knowl. Discov. 4, 217–240 (2000)CrossRef Bayardo, R., Agrawal, R., Gunopulos, D.: Constraint-based rule mining in large, dense databases. Data Min. Knowl. Discov. 4, 217–240 (2000)CrossRef
8.
Zurück zum Zitat Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. JMLR 3, 993–1022 (2003)MATH Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. JMLR 3, 993–1022 (2003)MATH
9.
Zurück zum Zitat Borji, A., Cheng, M.M., Jiang, H., Li, J.: Salient object detection: a benchmark. IEEE Trans. Image Process. 24(12), 5706–5722 (2015)MathSciNetCrossRef Borji, A., Cheng, M.M., Jiang, H., Li, J.: Salient object detection: a benchmark. IEEE Trans. Image Process. 24(12), 5706–5722 (2015)MathSciNetCrossRef
10.
Zurück zum Zitat Chrupała, G., Gelderloos, L., Alishahi, A.: Representations of language in a model of visually grounded speech signal. In: Proceedings of ACL, pp. 613–622 (2017) Chrupała, G., Gelderloos, L., Alishahi, A.: Representations of language in a model of visually grounded speech signal. In: Proceedings of ACL, pp. 613–622 (2017)
11.
Zurück zum Zitat Duivesteijn, W., Feelders, A.J., Knobbe, A.: Exceptional model mining. Data Min. Knowl. Discov. 30(1), 47–98 (2016)MathSciNetCrossRef Duivesteijn, W., Feelders, A.J., Knobbe, A.: Exceptional model mining. Data Min. Knowl. Discov. 30(1), 47–98 (2016)MathSciNetCrossRef
12.
Zurück zum Zitat Dwork, C., Hardt, M., Pitassi, T., Reingold, O., Zemel, R.S.: Fairness through awareness. CoRR abs/1104.3913 (2011) Dwork, C., Hardt, M., Pitassi, T., Reingold, O., Zemel, R.S.: Fairness through awareness. CoRR abs/1104.3913 (2011)
13.
Zurück zum Zitat Ganter, B., Wille, R.: Formal concept analysis. Wissenschaftliche Zeitschrift-Technischen Universitat Dresden 45, 8–13 (1996)MATH Ganter, B., Wille, R.: Formal concept analysis. Wissenschaftliche Zeitschrift-Technischen Universitat Dresden 45, 8–13 (1996)MATH
14.
Zurück zum Zitat Gatt, A., et al.: Face2Text: collecting an annotated image description corpus for the generation of rich face descriptions. In: Proceedings of LREC (2018) Gatt, A., et al.: Face2Text: collecting an annotated image description corpus for the generation of rich face descriptions. In: Proceedings of LREC (2018)
15.
Zurück zum Zitat Herlitz, A., Lovén, J.: Sex differences and the own-gender bias in face recognition: a meta-analytic review. Visual Cogn. 21(9–10), 1306–1336 (2013)CrossRef Herlitz, A., Lovén, J.: Sex differences and the own-gender bias in face recognition: a meta-analytic review. Visual Cogn. 21(9–10), 1306–1336 (2013)CrossRef
16.
Zurück zum Zitat Krishna, R., et al.: Visual genome: connecting language and vision using crowdsourced dense image annotations. Int. J. Comput. Vis. 123(1), 32–73 (2017)MathSciNetCrossRef Krishna, R., et al.: Visual genome: connecting language and vision using crowdsourced dense image annotations. Int. J. Comput. Vis. 123(1), 32–73 (2017)MathSciNetCrossRef
19.
Zurück zum Zitat Levin, D.T.: Race as a visual feature: using visual search and perceptual discrimination tasks to understand face categories and the cross-race recognition deficit. J. Exp. Psychol. Gen. 129(4), 559–574 (2000)CrossRef Levin, D.T.: Race as a visual feature: using visual search and perceptual discrimination tasks to understand face categories and the cross-race recognition deficit. J. Exp. Psychol. Gen. 129(4), 559–574 (2000)CrossRef
21.
Zurück zum Zitat Minka, T.: Estimating a Dirichlet distribution. Technical report, MIT (2000) Minka, T.: Estimating a Dirichlet distribution. Technical report, MIT (2000)
23.
Zurück zum Zitat Torralba, A., Efros, A.A.: Unbiased look at dataset bias. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 1521–1528. IEEE (2011) Torralba, A., Efros, A.A.: Unbiased look at dataset bias. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 1521–1528. IEEE (2011)
Metadaten
Titel
Identifying Exceptional Descriptions of People Using Topic Modeling and Subgroup Discovery
verfasst von
Andrew T. Hendrickson
Jason Wang
Martin Atzmueller
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-030-01851-1_44

Premium Partner