Skip to main content

2014 | OriginalPaper | Buchkapitel

A Study into Annotation Ranking Metrics in Community Contributed Image Corpora

verfasst von : Mark Hughes, Gareth J. F. Jones, Noel E. O’Connor

Erschienen in: Adaptive Multimedia Retrieval: Semantics, Context, and Adaptation

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Community contributed datasets are becoming increasing common in automated image annotation systems. One important issue with community image data is that there is no guarantee that the associated metadata is relevant. A method is required that can accurately rank the semantic relevance of community annotations. This should enable the extracting of relevant subsets from potentially noisy collections of these annotations. Having relevant, non-heterogeneous tags assigned to images should improve community image retrieval systems, such as Flickr, which are based on text retrieval methods. In the literature, the current state of the art approach to ranking the semantic relevance of Flickr tags is based on the widely used tf-idf metric. In the case of datasets containing landmark images, however, this metric is inefficient and can be improved upon. In this paper, we present a landmark recognition framework, that provides end-to-end automated recognition and annotation. In our study into automated annotation, we evaluate 5 alternate approaches to tf-idf to rank tag relevance in community contributed landmark image corpora. We carry out a thorough evaluation of each of these ranking metrics and results of this evaluation demonstrate that four of these proposed techniques outperform the current commonly-used tf-idf approach for this task. Our best performing evaluated approach achieves a significant F-Measure increase of .19 over tf-idf.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
Literatur
1.
Zurück zum Zitat Kennedy, L., Naaman, M., Ahern, S., Nair, R., Rattenbury, T.: How flickr helps us make sense of the world: context and content in community-contributed media collections. In: MULTIMEDIA ’07: Proceedings of the 15th international conference on Multimedia, pp. 631–640 (2007) Kennedy, L., Naaman, M., Ahern, S., Nair, R., Rattenbury, T.: How flickr helps us make sense of the world: context and content in community-contributed media collections. In: MULTIMEDIA ’07: Proceedings of the 15th international conference on Multimedia, pp. 631–640 (2007)
2.
Zurück zum Zitat Kennedy, L., Naaman, M.: Generating diverse and representative image search results for landmarks. In: WWW ’08: Proceeding of the 17th international conference on World Wide Web, pp. 297–306 (2008) Kennedy, L., Naaman, M.: Generating diverse and representative image search results for landmarks. In: WWW ’08: Proceeding of the 17th international conference on World Wide Web, pp. 297–306 (2008)
3.
Zurück zum Zitat Ahern, S., Naaman, M., Nair, R., Yang, J.: World explorer: visualizing aggregate data from unstructured text in geo-referenced collections. In: Proceedings of the Seventh ACM/IEEE-CS Joint Conference on Digital Libraries, pp. 1–10 (2007) Ahern, S., Naaman, M., Nair, R., Yang, J.: World explorer: visualizing aggregate data from unstructured text in geo-referenced collections. In: Proceedings of the Seventh ACM/IEEE-CS Joint Conference on Digital Libraries, pp. 1–10 (2007)
4.
Zurück zum Zitat Xirong, L., Snoek, C., Worring, M.: Annotating images by harnessing worldwide user-tagged photos. In: Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 3717–3720 (2009) Xirong, L., Snoek, C., Worring, M.: Annotating images by harnessing worldwide user-tagged photos. In: Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 3717–3720 (2009)
5.
Zurück zum Zitat Mahapatra, A., Wan, X., Tian, Y., Srivastava, J.: Augmenting image processing with social tag mining for landmark recognition. In: Lee, K.-T., Tsai, W.-H., Liao, H.-Y.M., Chen, T., Hsieh, J.-W., Tseng, C.-C. (eds.) MMM 2011 Part I. LNCS, vol. 6523, pp. 273–283. Springer, Heidelberg (2011)CrossRef Mahapatra, A., Wan, X., Tian, Y., Srivastava, J.: Augmenting image processing with social tag mining for landmark recognition. In: Lee, K.-T., Tsai, W.-H., Liao, H.-Y.M., Chen, T., Hsieh, J.-W., Tseng, C.-C. (eds.) MMM 2011 Part I. LNCS, vol. 6523, pp. 273–283. Springer, Heidelberg (2011)CrossRef
6.
Zurück zum Zitat Sigurbornsson, B., Van Zwol, R.: Flickr tag recommendation based on collective knowledge. In: WWW ’08: Proceeding of the 17th International Conference on World Wide Web, pp. 327–336 (2008) Sigurbornsson, B., Van Zwol, R.: Flickr tag recommendation based on collective knowledge. In: WWW ’08: Proceeding of the 17th International Conference on World Wide Web, pp. 327–336 (2008)
7.
Zurück zum Zitat Bay, H., Tuytelaars, T., Van Gool, L.: SURF: speeded up robust features. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006, Part I. LNCS, vol. 3951, pp. 404–417. Springer, Heidelberg (2006)CrossRef Bay, H., Tuytelaars, T., Van Gool, L.: SURF: speeded up robust features. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006, Part I. LNCS, vol. 3951, pp. 404–417. Springer, Heidelberg (2006)CrossRef
8.
Zurück zum Zitat Nister, D., Stewenius, H.: Scalable recognition with a vocabulary tree. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 2161–2168 (2006) Nister, D., Stewenius, H.: Scalable recognition with a vocabulary tree. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 2161–2168 (2006)
9.
Zurück zum Zitat Sivic, J., Zisserman, A.: DVideo Google: a text retrieval approach to object matching in videos. In: Ninth IEEE International Conference on Computer Vision 2003, Proceedings, pp. 1470–1477 (2003) Sivic, J., Zisserman, A.: DVideo Google: a text retrieval approach to object matching in videos. In: Ninth IEEE International Conference on Computer Vision 2003, Proceedings, pp. 1470–1477 (2003)
10.
Zurück zum Zitat Lowe, D.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60, 91–110 (2004)CrossRef Lowe, D.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60, 91–110 (2004)CrossRef
11.
Zurück zum Zitat Girardin, F., Blat, J.: Place this photo on a map: a study of explicit disclosure of location information. In: UbiComp (2007) Girardin, F., Blat, J.: Place this photo on a map: a study of explicit disclosure of location information. In: UbiComp (2007)
12.
Zurück zum Zitat Hollenstein, L.: Capturing vernacular geography from georeferenced tags. Masters thesis, University of Zurich (2008) Hollenstein, L.: Capturing vernacular geography from georeferenced tags. Masters thesis, University of Zurich (2008)
Metadaten
Titel
A Study into Annotation Ranking Metrics in Community Contributed Image Corpora
verfasst von
Mark Hughes
Gareth J. F. Jones
Noel E. O’Connor
Copyright-Jahr
2014
DOI
https://doi.org/10.1007/978-3-319-12093-5_8

Neuer Inhalt