Skip to main content
Erschienen in:
Buchtitelbild

2018 | OriginalPaper | Buchkapitel

Image Annotation Using a Semantic Hierarchy

verfasst von : Abdessalem Bouzaieni, Salvatore Tabbone

Erschienen in: Structural, Syntactic, and Statistical Pattern Recognition

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

With the fast development of smartphones and social media image sharing, automatic image annotation has become a research area of great interest. It enables indexing, extracting and searching in large collections of images in an easier and faster way. In this paper, we propose a model for the annotation extension of images using a semantic hierarchy. This latter is built from vocabulary keyword annotations combining a mixture of Bernoulli distributions with mixtures of Gaussians.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Bannour, H., Hudelot, C.: Building and using fuzzy multimedia ontologies for semantic image annotation. Multimed. Tools Appl. 72, 2107–2141 (2014)CrossRef Bannour, H., Hudelot, C.: Building and using fuzzy multimedia ontologies for semantic image annotation. Multimed. Tools Appl. 72, 2107–2141 (2014)CrossRef
3.
Zurück zum Zitat Bart, E., Porteous, I., Perona, P., Welling, M.: Unsupervised learning of visual taxonomies. In: CVPR, pp. 1–8. IEEE (2008) Bart, E., Porteous, I., Perona, P., Welling, M.: Unsupervised learning of visual taxonomies. In: CVPR, pp. 1–8. IEEE (2008)
5.
Zurück zum Zitat Cao, X., Zhang, H., Guo, X., Liu, S., Meng, D.: SLED: semantic label embedding dictionary representation for multilabel image annotation. IEEE IP 24(9), 2746–2759 (2015)MathSciNet Cao, X., Zhang, H., Guo, X., Liu, S., Meng, D.: SLED: semantic label embedding dictionary representation for multilabel image annotation. IEEE IP 24(9), 2746–2759 (2015)MathSciNet
6.
Zurück zum Zitat Chong, W., Blei, D., Li, F.F.: Simultaneous image classification and annotation. In: CVPR, pp. 1903–1910. IEEE (2009) Chong, W., Blei, D., Li, F.F.: Simultaneous image classification and annotation. In: CVPR, pp. 1903–1910. IEEE (2009)
7.
Zurück zum Zitat Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. JRSS Ser. B 39(1), 1–38 (1977)MathSciNetMATH Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. JRSS Ser. B 39(1), 1–38 (1977)MathSciNetMATH
8.
Zurück zum Zitat El-Bendary, N., Kim, T.H., Hassanien, A.E., Sami, M.: Automatic image annotation approach based on optimization of classes scores. Computing 96(5), 381–402 (2014)CrossRef El-Bendary, N., Kim, T.H., Hassanien, A.E., Sami, M.: Automatic image annotation approach based on optimization of classes scores. Computing 96(5), 381–402 (2014)CrossRef
9.
Zurück zum Zitat Feng, S., Manmatha, R., Lavrenko, V.: Multiple Bernoulli relevance models for image and video annotation. In: CVPR, vol. 2, pp. 1002–1009. IEEE (2004) Feng, S., Manmatha, R., Lavrenko, V.: Multiple Bernoulli relevance models for image and video annotation. In: CVPR, vol. 2, pp. 1002–1009. IEEE (2004)
10.
Zurück zum Zitat Fountain, T., Lapata, M.: Taxonomy induction using hierarchical random graphs. In: ACL, pp. 466–476 (2012) Fountain, T., Lapata, M.: Taxonomy induction using hierarchical random graphs. In: ACL, pp. 466–476 (2012)
12.
Zurück zum Zitat Griffin, G., Perona, P.: Learning and using taxonomies for fast visual categorization. In: CVPR, pp. 1–8. IEEE (2008) Griffin, G., Perona, P.: Learning and using taxonomies for fast visual categorization. In: CVPR, pp. 1–8. IEEE (2008)
13.
Zurück zum Zitat Ji, P., Gao, X., Hu, X.: Automatic image annotation by combining generative and discriminant models. Neurocomputing 236, 48–55 (2017)CrossRef Ji, P., Gao, X., Hu, X.: Automatic image annotation by combining generative and discriminant models. Neurocomputing 236, 48–55 (2017)CrossRef
14.
Zurück zum Zitat Jing, X.Y., Wu, F., Li, Z., Hu, R., Zhang, D.: Multi-label dictionary learning for image annotation. IEEE Trans. Image Process. 25(6), 2712–2725 (2016)MathSciNetCrossRef Jing, X.Y., Wu, F., Li, Z., Hu, R., Zhang, D.: Multi-label dictionary learning for image annotation. IEEE Trans. Image Process. 25(6), 2712–2725 (2016)MathSciNetCrossRef
15.
Zurück zum Zitat Kalayeh, M.M., Idrees, H., Shah, M.: NMF-KNN: image annotation using weighted multi-view non-negative matrix factorization. In: CVPR, pp. 184–191 (2014) Kalayeh, M.M., Idrees, H., Shah, M.: NMF-KNN: image annotation using weighted multi-view non-negative matrix factorization. In: CVPR, pp. 184–191 (2014)
16.
Zurück zum Zitat Lauritzen, S.L., Spiegelhalter, D.J.: Local computations with probabilities on graphical structures and their application to expert systems. JRSS Ser. B 50(2), 157–224 (1988)MathSciNetMATH Lauritzen, S.L., Spiegelhalter, D.J.: Local computations with probabilities on graphical structures and their application to expert systems. JRSS Ser. B 50(2), 157–224 (1988)MathSciNetMATH
17.
Zurück zum Zitat Li, L.J., Socher, R., Fei-Fei, L.: Towards total scene understanding: classification, annotation and segmentation in an automatic framework. In: CVPR, pp. 2036–2043. IEEE (2009) Li, L.J., Socher, R., Fei-Fei, L.: Towards total scene understanding: classification, annotation and segmentation in an automatic framework. In: CVPR, pp. 2036–2043. IEEE (2009)
18.
Zurück zum Zitat Liu, X., Song, Y., Liu, S., Wang, H.: Automatic taxonomy construction from keywords. In: ACM SIGKDD, pp. 1433–1441. ACM (2012) Liu, X., Song, Y., Liu, S., Wang, H.: Automatic taxonomy construction from keywords. In: ACM SIGKDD, pp. 1433–1441. ACM (2012)
19.
Zurück zum Zitat Low, D.G.: Object recognition from local scale-invariant features. In: Proceedings of the International Conference on Computer Vision, vol. 2, pp. 1150–1157 (1999) Low, D.G.: Object recognition from local scale-invariant features. In: Proceedings of the International Conference on Computer Vision, vol. 2, pp. 1150–1157 (1999)
20.
Zurück zum Zitat Maihami, V., Yaghmaee, F.: Fuzzy neighbor voting for automatic image annotation. JECEI 4(1), 1–8 (2016) Maihami, V., Yaghmaee, F.: Fuzzy neighbor voting for automatic image annotation. JECEI 4(1), 1–8 (2016)
21.
Zurück zum Zitat Marszalek, M., Schmid, C.: Semantic hierarchies for visual object recognition. In: CVPR (2007) Marszalek, M., Schmid, C.: Semantic hierarchies for visual object recognition. In: CVPR (2007)
22.
Zurück zum Zitat Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013) Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:​1301.​3781 (2013)
23.
Zurück zum Zitat Miller, G.A.: WordNet: a lexical database for English. Commun. ACM 38(11), 39–41 (1995)CrossRef Miller, G.A.: WordNet: a lexical database for English. Commun. ACM 38(11), 39–41 (1995)CrossRef
24.
Zurück zum Zitat Murthy, V.N., Can, E.F., Manmatha, R.: A hybrid model for automatic image annotation. In: ICMR, pp. 369–376. ACM (2014) Murthy, V.N., Can, E.F., Manmatha, R.: A hybrid model for automatic image annotation. In: ICMR, pp. 369–376. ACM (2014)
25.
Zurück zum Zitat Murthy, V.N., Maji, S., Manmatha, R.: Automatic image annotation using deep learning representations. In: ICMR, pp. 603–606. ACM (2015) Murthy, V.N., Maji, S., Manmatha, R.: Automatic image annotation using deep learning representations. In: ICMR, pp. 603–606. ACM (2015)
26.
Zurück zum Zitat Murthy, V.N., Sharma, A., Chari, V., Manmatha, R.: Image annotation using multi-scale hypergraph heat diffusion framework. In: ICMR. ACM (2016) Murthy, V.N., Sharma, A., Chari, V., Manmatha, R.: Image annotation using multi-scale hypergraph heat diffusion framework. In: ICMR. ACM (2016)
27.
Zurück zum Zitat Ojala, T., Pietikäinen, M., Harwood, D.: A comparative study of texture measures with classification based on featured distributions. PR 29(1), 51–59 (1996) Ojala, T., Pietikäinen, M., Harwood, D.: A comparative study of texture measures with classification based on featured distributions. PR 29(1), 51–59 (1996)
28.
Zurück zum Zitat Oliva, A., Torralba, A.: Modeling the shape of the scene: a holistic representation of the spatial envelope. Int. J. Comput. Vis. 42(3), 145–175 (2001)CrossRef Oliva, A., Torralba, A.: Modeling the shape of the scene: a holistic representation of the spatial envelope. Int. J. Comput. Vis. 42(3), 145–175 (2001)CrossRef
29.
Zurück zum Zitat Qian, Z., Zhong, P., Chen, J.: Integrating global and local visual features with semantic hierarchies for two-level image annotation. Neurocomputing 171, 1167–1174 (2016)CrossRef Qian, Z., Zhong, P., Chen, J.: Integrating global and local visual features with semantic hierarchies for two-level image annotation. Neurocomputing 171, 1167–1174 (2016)CrossRef
30.
Zurück zum Zitat Swain, M.J., Ballard, D.H.: Color indexing. IJCV 7(1), 11–32 (1991)CrossRef Swain, M.J., Ballard, D.H.: Color indexing. IJCV 7(1), 11–32 (1991)CrossRef
31.
Zurück zum Zitat Tousch, A.M., Herbin, S., Audibert, J.Y.: Semantic hierarchies for image annotation: a survey. PR 45(1), 333–345 (2012) Tousch, A.M., Herbin, S., Audibert, J.Y.: Semantic hierarchies for image annotation: a survey. PR 45(1), 333–345 (2012)
32.
Zurück zum Zitat Uricchio, T., Ballan, L., Seidenari, L., Bimbo, A.D.: Automatic image annotation via label transfer in the semantic space. PR 71, 144–157 (2017) Uricchio, T., Ballan, L., Seidenari, L., Bimbo, A.D.: Automatic image annotation via label transfer in the semantic space. PR 71, 144–157 (2017)
34.
Zurück zum Zitat Wu, L., Hua, X.S., Yu, N., Ma, W.Y., Li, S.: Flickr distance: a relationship measure for visual concepts. TPAMI 34(5), 863–875 (2012)CrossRef Wu, L., Hua, X.S., Yu, N., Ma, W.Y., Li, S.: Flickr distance: a relationship measure for visual concepts. TPAMI 34(5), 863–875 (2012)CrossRef
35.
Zurück zum Zitat Zhang, D., Islam, M.M., Lu, G.: A review on automatic image annotation techniques. PR 45(1), 346–362 (2012) Zhang, D., Islam, M.M., Lu, G.: A review on automatic image annotation techniques. PR 45(1), 346–362 (2012)
Metadaten
Titel
Image Annotation Using a Semantic Hierarchy
verfasst von
Abdessalem Bouzaieni
Salvatore Tabbone
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-97785-0_1