Skip to main content

2014 | OriginalPaper | Buchkapitel

Research on Middle-Semantic Manifold Object Annotation

verfasst von : Wengang Feng, Shaozhong Wu

Erschienen in: Foundations and Practical Applications of Cognitive Systems and Information Processing

Verlag: Springer Berlin Heidelberg

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

A novel bionic, middle-semantic object annotation framework is presented in this paper. Moreover, we build the model based on the perception as defined by the human visual system. At first, the super-pixel is used to represent the images, and conditional random field could label each of the super-pixels, which means annotating the different classes of objects. In next step, on the basis of the previous result, image pyramid is used to represent the image, and get the sub-region of some objects of the same class. After extracting descriptor to represent the patches, all the patches are projected to a manifold, which could annotate the different views of objects from the same class. Experiments show that the bionic, middle-semantic object annotation framework could obtain superior results with respect to accuracy, and it could verify the correctness of WordNet indirectly.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Smith TF, Waterman MS (1981) Identification of common molecular subsequences. J Mol Biol 147:195–197CrossRef Smith TF, Waterman MS (1981) Identification of common molecular subsequences. J Mol Biol 147:195–197CrossRef
2.
Zurück zum Zitat Foster I, Kesselman C (1999) The grid: blueprint for a new computing infrastructure. Morgan Kaufmann, San Francisco Foster I, Kesselman C (1999) The grid: blueprint for a new computing infrastructure. Morgan Kaufmann, San Francisco
3.
Zurück zum Zitat Czajkowski K, Fitzgerald S, Foster I, Kesselman C (2001) Grid information services for distributed resource sharing. In: 10th IEEE international symposium on high performance distributed computing. IEEE Press, New York, pp 181–184 Czajkowski K, Fitzgerald S, Foster I, Kesselman C (2001) Grid information services for distributed resource sharing. In: 10th IEEE international symposium on high performance distributed computing. IEEE Press, New York, pp 181–184
4.
Zurück zum Zitat Foster I, Kesselman C, Nick J, Tuecke S (2002) The physiology of the grid: an open grid services architecture for distributed systems integration. Technical report, Global Grid Forum Foster I, Kesselman C, Nick J, Tuecke S (2002) The physiology of the grid: an open grid services architecture for distributed systems integration. Technical report, Global Grid Forum
5.
Zurück zum Zitat Pan J-Y et al (2004) GCap: Graph-based automatic image captioning. In: Proceedings of the conference on computer vision and pattern recognition workshop, 9:146–154 Pan J-Y et al (2004) GCap: Graph-based automatic image captioning. In: Proceedings of the conference on computer vision and pattern recognition workshop, 9:146–154
6.
Zurück zum Zitat Felzenszwalb P, Huttenlocher D (2006) Pictorial structures for object recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2066–2073 Felzenszwalb P, Huttenlocher D (2006) Pictorial structures for object recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2066–2073
7.
Zurück zum Zitat Li BT, Goh K, Chang E (2003) Confidence-based dynamic ensemble for image annotation and semantics discovery. In: Proceedings of ACM international conference on multimedia, pp 195–206 Li BT, Goh K, Chang E (2003) Confidence-based dynamic ensemble for image annotation and semantics discovery. In: Proceedings of ACM international conference on multimedia, pp 195–206
8.
Zurück zum Zitat Carneiro G, Vasconcelos N (2005) A database centric view of semantic image annotation and retrieval. In: Proceedings of the 28th annual international ACM SIGIR conference on research and development in information retrieval, pp 559–566 Carneiro G, Vasconcelos N (2005) A database centric view of semantic image annotation and retrieval. In: Proceedings of the 28th annual international ACM SIGIR conference on research and development in information retrieval, pp 559–566
9.
Zurück zum Zitat Duygulu P et al (2002) Object recognition as machine translation: learning a lexicon for a fixed image vocabulary. In: Proceedings of the European conference on computer vision, pp 97–112 Duygulu P et al (2002) Object recognition as machine translation: learning a lexicon for a fixed image vocabulary. In: Proceedings of the European conference on computer vision, pp 97–112
10.
Zurück zum Zitat Pan JY, Yang HJ (2004) Automatic multimedia cross-modal correlation discovery. In: KDD’04, pp 322–330 Pan JY, Yang HJ (2004) Automatic multimedia cross-modal correlation discovery. In: KDD’04, pp 322–330
11.
Zurück zum Zitat Li J, Wang JZ (2006) Real-time computerized annotation of pictures. In: Proceedings of the ACM international conference on multimedia, pp 911–920 Li J, Wang JZ (2006) Real-time computerized annotation of pictures. In: Proceedings of the ACM international conference on multimedia, pp 911–920
12.
Zurück zum Zitat Barnard K (2003) Matching Words and Pictures. J Mach Learn Res 3:1107–1135MATH Barnard K (2003) Matching Words and Pictures. J Mach Learn Res 3:1107–1135MATH
13.
Zurück zum Zitat Vailaya A, Figueiredo A, Jain A, Zhang H (2001) Image classification for content-based indexing. IEEE Trans Image Process 10:117–129CrossRefMATH Vailaya A, Figueiredo A, Jain A, Zhang H (2001) Image classification for content-based indexing. IEEE Trans Image Process 10:117–129CrossRefMATH
Metadaten
Titel
Research on Middle-Semantic Manifold Object Annotation
verfasst von
Wengang Feng
Shaozhong Wu
Copyright-Jahr
2014
Verlag
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/978-3-642-37835-5_20