Skip to main content
Erschienen in: Neural Computing and Applications 23/2021

26.06.2021 | Original Article

Topic-based label distribution learning to exploit label ambiguity for scene classification

verfasst von: Jianqiao Luo, Biao He, Yang Ou, Bailin Li, Kai Wang

Erschienen in: Neural Computing and Applications | Ausgabe 23/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

One of the greatest challenges for scene classification is the lack of sufficient training samples. Label distribution learning (LDL) is proven to be effective in handling insufficient samples by exploiting label ambiguity. However, LDL has never been used in scene classification because the correlations among scene classes are unavailable, making it impossible to construct label distribution vectors for images. In this paper, we aim to transform LDL into scene classification. To this end, we introduce a probabilistic topic model (PTM) to capture label correlations, and propose a method termed as topic-based LDL (TB-LDL). By treating scene classes as documents in the PTM, the discovered topics indicate typical scene patterns, and class-topic distributions provide label measurements on multiple topics. For each topic, scenes with similar label measurements can be considered as neighbouring labels. The label distributions smooth image truth labels based on label correlations, which can formulate the label ambiguity of scene images. Training networks with the label distributions can prevent over-fitting and assist feature learning. Extensive experiments on two challenging datasets, namely the aerial image dataset (AID) and NWPU_RESISC45 (NR), demonstrate that our method is effective, especially when the amount of training data is limited.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Cheng G, Han J, Lu X (2017) Remote sensing image scene classification: benchmark and state of the art. Proc IEEE 105:1865–1883CrossRef Cheng G, Han J, Lu X (2017) Remote sensing image scene classification: benchmark and state of the art. Proc IEEE 105:1865–1883CrossRef
24.
32.
Zurück zum Zitat Das P, Xu C, Doell RF, Corso JJ (2013) A thousand frames in just a few words: lingual description of videos through latent topics and sparse object stitching. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition. pp 2634–2641. https://doi.org/10.1109/CVPR.2013.340 Das P, Xu C, Doell RF, Corso JJ (2013) A thousand frames in just a few words: lingual description of videos through latent topics and sparse object stitching. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition. pp 2634–2641. https://​doi.​org/​10.​1109/​CVPR.​2013.​340
34.
Zurück zum Zitat Hessam B, Maxwell H, Mohammad R et al (2018) Label refinery improving ImageNet classification through label progression. In: IEEE computer society conference on computer vision and pattern recognition. arXiv:1805.02641 Hessam B, Maxwell H, Mohammad R et al (2018) Label refinery improving ImageNet classification through label progression. In: IEEE computer society conference on computer vision and pattern recognition. arXiv:​1805.​02641
35.
Zurück zum Zitat Müller R, Kornblith S, Hinton G (2019) When does label smoothing help? In: Neural information processing systems workshops, NIPS 2019, proceedings of the conference. arXiv:1906.02629 Müller R, Kornblith S, Hinton G (2019) When does label smoothing help? In: Neural information processing systems workshops, NIPS 2019, proceedings of the conference. arXiv:​1906.​02629
38.
Zurück zum Zitat Pereyra G, Tucker G, Chorowski J et al (2017) Regularizing neural networks by penalizing confident output distributions. In: 5th international conference on learning representations, ICLR 2017, proceedings of the conference. arXiv:1701.06548 Pereyra G, Tucker G, Chorowski J et al (2017) Regularizing neural networks by penalizing confident output distributions. In: 5th international conference on learning representations, ICLR 2017, proceedings of the conference. arXiv:​1701.​06548
39.
48.
Zurück zum Zitat Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. In: 3rd international conference on learning representations, ICLR 2015 - conference track proceedings. arXiv: 1409.1556. Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. In: 3rd international conference on learning representations, ICLR 2015 - conference track proceedings. arXiv: 1409.1556.
49.
Zurück zum Zitat Maaten L.v.d, Hinton G (2008) Visualizing data using t-SNE. J Mach Learn Res 9:2579–2625MATH Maaten L.v.d, Hinton G (2008) Visualizing data using t-SNE. J Mach Learn Res 9:2579–2625MATH
51.
Zurück zum Zitat Stevens K, Kegelmeyer P, Andrzejewski D, Buttler D (2012) Exploring topic coherence over many models and many topics. In: EMNLP-CoNLL 2012 - 2012 joint conference on empirical methods in natural language processing and computational natural language learning, proceedings of the conference. pp 952–961 Stevens K, Kegelmeyer P, Andrzejewski D, Buttler D (2012) Exploring topic coherence over many models and many topics. In: EMNLP-CoNLL 2012 - 2012 joint conference on empirical methods in natural language processing and computational natural language learning, proceedings of the conference. pp 952–961
59.
Zurück zum Zitat Hinton G, Vinyals O, Dean J. (2015) Distilling the knowledge in a neural network. In: Neural information processing systems workshops, NIPS 2015, proceedings of the conference. arXiv:1503.02531 Hinton G, Vinyals O, Dean J. (2015) Distilling the knowledge in a neural network. In: Neural information processing systems workshops, NIPS 2015, proceedings of the conference. arXiv:​1503.​02531
Metadaten
Titel
Topic-based label distribution learning to exploit label ambiguity for scene classification
verfasst von
Jianqiao Luo
Biao He
Yang Ou
Bailin Li
Kai Wang
Publikationsdatum
26.06.2021
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 23/2021
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-021-06218-w

Weitere Artikel der Ausgabe 23/2021

Neural Computing and Applications 23/2021 Zur Ausgabe

Premium Partner