Skip to main content
Top

2018 | OriginalPaper | Chapter

Identifying Exceptional Descriptions of People Using Topic Modeling and Subgroup Discovery

Authors : Andrew T. Hendrickson, Jason Wang, Martin Atzmueller

Published in: Foundations of Intelligent Systems

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Descriptions of images form the backbone for many intelligent systems, assuming descriptions that randomly vary in construction and content, but where description content is homogeneous. This assumption becomes problematic being extended to descriptions of images of people [14], where people are known to show systematic biases in how they process others [19]. Therefore, this paper presents a novel approach for discovering exceptional subgroups of descriptions in which the content of those descriptions reliably differs from the general set of descriptions. We develop a novel interestingness measure for subgroup discovery appropriate for probability distributions across semantic representations. The proposed method is applied to a web-based experiment in which 500 raters describe images of 200 people. Our analysis identifies multiple exceptional subgroups and the attributes of the respective raters and images. We further discuss implications for intelligent systems.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
The difference between documents was calculated as the sum across all pairs of descriptions of the cosine similarity of the topic probability distributions. The number of topics per document was calculated as the sum across all descriptions of the conditional entropy of the topic probability distribution.
 
Literature
1.
go back to reference Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proceedings of VLDB, pp. 487–499. Morgan Kaufmann (1994) Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proceedings of VLDB, pp. 487–499. Morgan Kaufmann (1994)
2.
go back to reference Antol, S., et al.: VQA: visual question answering. In: Proceedings of IEEE ICCV, pp. 2425–2433 (2015) Antol, S., et al.: VQA: visual question answering. In: Proceedings of IEEE ICCV, pp. 2425–2433 (2015)
3.
go back to reference Atzmueller, M.: Subgroup discovery. WIREs DMKD 5(1), 35–49 (2015) Atzmueller, M.: Subgroup discovery. WIREs DMKD 5(1), 35–49 (2015)
5.
go back to reference Atzmueller, M., Lemmerich, F.: Exploratory pattern mining on social media using geo-references and social tagging information. IJWS 2(1/2), 80–112 (2013)CrossRef Atzmueller, M., Lemmerich, F.: Exploratory pattern mining on social media using geo-references and social tagging information. IJWS 2(1/2), 80–112 (2013)CrossRef
7.
go back to reference Bayardo, R., Agrawal, R., Gunopulos, D.: Constraint-based rule mining in large, dense databases. Data Min. Knowl. Discov. 4, 217–240 (2000)CrossRef Bayardo, R., Agrawal, R., Gunopulos, D.: Constraint-based rule mining in large, dense databases. Data Min. Knowl. Discov. 4, 217–240 (2000)CrossRef
8.
go back to reference Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. JMLR 3, 993–1022 (2003)MATH Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. JMLR 3, 993–1022 (2003)MATH
9.
go back to reference Borji, A., Cheng, M.M., Jiang, H., Li, J.: Salient object detection: a benchmark. IEEE Trans. Image Process. 24(12), 5706–5722 (2015)MathSciNetCrossRef Borji, A., Cheng, M.M., Jiang, H., Li, J.: Salient object detection: a benchmark. IEEE Trans. Image Process. 24(12), 5706–5722 (2015)MathSciNetCrossRef
10.
go back to reference Chrupała, G., Gelderloos, L., Alishahi, A.: Representations of language in a model of visually grounded speech signal. In: Proceedings of ACL, pp. 613–622 (2017) Chrupała, G., Gelderloos, L., Alishahi, A.: Representations of language in a model of visually grounded speech signal. In: Proceedings of ACL, pp. 613–622 (2017)
11.
go back to reference Duivesteijn, W., Feelders, A.J., Knobbe, A.: Exceptional model mining. Data Min. Knowl. Discov. 30(1), 47–98 (2016)MathSciNetCrossRef Duivesteijn, W., Feelders, A.J., Knobbe, A.: Exceptional model mining. Data Min. Knowl. Discov. 30(1), 47–98 (2016)MathSciNetCrossRef
12.
go back to reference Dwork, C., Hardt, M., Pitassi, T., Reingold, O., Zemel, R.S.: Fairness through awareness. CoRR abs/1104.3913 (2011) Dwork, C., Hardt, M., Pitassi, T., Reingold, O., Zemel, R.S.: Fairness through awareness. CoRR abs/1104.3913 (2011)
13.
go back to reference Ganter, B., Wille, R.: Formal concept analysis. Wissenschaftliche Zeitschrift-Technischen Universitat Dresden 45, 8–13 (1996)MATH Ganter, B., Wille, R.: Formal concept analysis. Wissenschaftliche Zeitschrift-Technischen Universitat Dresden 45, 8–13 (1996)MATH
14.
go back to reference Gatt, A., et al.: Face2Text: collecting an annotated image description corpus for the generation of rich face descriptions. In: Proceedings of LREC (2018) Gatt, A., et al.: Face2Text: collecting an annotated image description corpus for the generation of rich face descriptions. In: Proceedings of LREC (2018)
15.
go back to reference Herlitz, A., Lovén, J.: Sex differences and the own-gender bias in face recognition: a meta-analytic review. Visual Cogn. 21(9–10), 1306–1336 (2013)CrossRef Herlitz, A., Lovén, J.: Sex differences and the own-gender bias in face recognition: a meta-analytic review. Visual Cogn. 21(9–10), 1306–1336 (2013)CrossRef
16.
go back to reference Krishna, R., et al.: Visual genome: connecting language and vision using crowdsourced dense image annotations. Int. J. Comput. Vis. 123(1), 32–73 (2017)MathSciNetCrossRef Krishna, R., et al.: Visual genome: connecting language and vision using crowdsourced dense image annotations. Int. J. Comput. Vis. 123(1), 32–73 (2017)MathSciNetCrossRef
19.
go back to reference Levin, D.T.: Race as a visual feature: using visual search and perceptual discrimination tasks to understand face categories and the cross-race recognition deficit. J. Exp. Psychol. Gen. 129(4), 559–574 (2000)CrossRef Levin, D.T.: Race as a visual feature: using visual search and perceptual discrimination tasks to understand face categories and the cross-race recognition deficit. J. Exp. Psychol. Gen. 129(4), 559–574 (2000)CrossRef
21.
go back to reference Minka, T.: Estimating a Dirichlet distribution. Technical report, MIT (2000) Minka, T.: Estimating a Dirichlet distribution. Technical report, MIT (2000)
23.
go back to reference Torralba, A., Efros, A.A.: Unbiased look at dataset bias. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 1521–1528. IEEE (2011) Torralba, A., Efros, A.A.: Unbiased look at dataset bias. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 1521–1528. IEEE (2011)
Metadata
Title
Identifying Exceptional Descriptions of People Using Topic Modeling and Subgroup Discovery
Authors
Andrew T. Hendrickson
Jason Wang
Martin Atzmueller
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-030-01851-1_44

Premium Partner