Skip to main content
Erschienen in: Journal of Intelligent Information Systems 2/2014

01.10.2014

Automatic content based image retrieval using semantic analysis

verfasst von: Eugene Santos Jr., Qi Gu

Erschienen in: Journal of Intelligent Information Systems | Ausgabe 2/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

We present a new text-to-image re-ranking approach for improving the relevancy rate in searches. In particular, we focus on the fundamental semantic gap that exists between the low-level visual features of the image and high-level textual queries by dynamically maintaining a connected hierarchy in the form of a concept database. For each textual query, we take the results from popular search engines as an initial retrieval, followed by a semantic analysis to map the textual query to higher level concepts. In order to do this, we design a two-layer scoring system which can identify the relationship between the query and the concepts automatically. We then calculate the image feature vectors and compare them with the classifier for each related concept. An image is relevant only when it is related to the query both semantically and content-wise. The second feature of this work is that we loosen the requirement for query accuracy from the user, which makes it possible to perform well on users’ queries containing less relevant information. Thirdly, the concept database can be dynamically maintained to satisfy the variations in user queries, which eliminates the need for human labor in building a sophisticated initial concept database. We designed our experiment using complex queries (based on five scenarios) to demonstrate how our retrieval results are a significant improvement over those obtained from current state-of-the-art image search engines.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
The image data can be downloaded from: http://​labelme.​csail.​mit.​edu/​
 
Literatur
Zurück zum Zitat Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.G. (2007). Dbpedia: A nucleus for a web of open data. Proc. International Semantic Web Conference (ISWC) (pp. 11–15). Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.G. (2007). Dbpedia: A nucleus for a web of open data. Proc. International Semantic Web Conference (ISWC) (pp. 11–15).
Zurück zum Zitat Banko, M., Cafarella, M.J., Soderland, S., Broadhead, M., Etzioni, O. (2007). In “Open information extraction from the web,”: Proc. International Joint Conference on Artificial Intelligence (IJCAI). Banko, M., Cafarella, M.J., Soderland, S., Broadhead, M., Etzioni, O. (2007). In “Open information extraction from the web,”: Proc. International Joint Conference on Artificial Intelligence (IJCAI).
Zurück zum Zitat Benitez, A.B., & Chang, S.-F. (2003). Image classification using multimedia knowledge networks. Proc. IEEE International Conference on Image Processing (ICIP) (pp. 613–616). Benitez, A.B., & Chang, S.-F. (2003). Image classification using multimedia knowledge networks. Proc. IEEE International Conference on Image Processing (ICIP) (pp. 613–616).
Zurück zum Zitat Blei, D., Ng, A., Jordan, M. (2003). Latent Dirichlet allocation. Journal of Machine Learning Research, 3, 993–1022.MATH Blei, D., Ng, A., Jordan, M. (2003). Latent Dirichlet allocation. Journal of Machine Learning Research, 3, 993–1022.MATH
Zurück zum Zitat Bosch, A., Zisserman, A., Muñoz, X. (2007). Image classification using random forests and ferns. Proc. International Conference Computer Vision (ICCV). Bosch, A., Zisserman, A., Muñoz, X. (2007). Image classification using random forests and ferns. Proc. International Conference Computer Vision (ICCV).
Zurück zum Zitat Carbonell, J.G., et al. (1997). Translingual information retrieval: A comparative evaluation. Proc. int’l joint conf. artificial intelligence. Morgan Kaufmann, 708–715. Carbonell, J.G., et al. (1997). Translingual information retrieval: A comparative evaluation. Proc. int’l joint conf. artificial intelligence. Morgan Kaufmann, 708–715.
Zurück zum Zitat Carneiro, G., Chan, A.B., Moreno, P.J., Vasconcelos, N. (2006). Supervised learning of semantic classes for image annotation and retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 29(3), 394–410.CrossRef Carneiro, G., Chan, A.B., Moreno, P.J., Vasconcelos, N. (2006). Supervised learning of semantic classes for image annotation and retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 29(3), 394–410.CrossRef
Zurück zum Zitat Carson, C., Belongie, S., Greenspan, H., Malik, J. (2002). Blobworld: image segmentation using expectation-maximization and its application to image querying. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(8), 1026–1038.CrossRef Carson, C., Belongie, S., Greenspan, H., Malik, J. (2002). Blobworld: image segmentation using expectation-maximization and its application to image querying. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(8), 1026–1038.CrossRef
Zurück zum Zitat Chen, Y., & Wang, J.Z. (2002). A region-based fuzzy feature matching approach to content-based image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(9), 252–1267. Chen, Y., & Wang, J.Z. (2002). A region-based fuzzy feature matching approach to content-based image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(9), 252–1267.
Zurück zum Zitat Chen, Y.X., & Wang, J.Z. (2004). Image categorization by learning and reasoning with regions. Journal of Machine Learning Research, 5, 913–939 . Chen, Y.X., & Wang, J.Z. (2004). Image categorization by learning and reasoning with regions. Journal of Machine Learning Research, 5, 913–939 .
Zurück zum Zitat Cowling, P.I., Remde, S.M., Hartley, P., Stewart, W., Stock-Brooks, J., Woolley, T. (2010). C-Link: concept linkage in knowledge repositories. AAAI Spring Symposium Series. Cowling, P.I., Remde, S.M., Hartley, P., Stewart, W., Stock-Brooks, J., Woolley, T. (2010). C-Link: concept linkage in knowledge repositories. AAAI Spring Symposium Series.
Zurück zum Zitat Dy, J.G., Brodley, C.E., Kak, A., Broderick, L.S., Aisen, A.M. (2003). Unsupervised feature selection applied to content-based retrieval of lung images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 25(3), 373–378.CrossRef Dy, J.G., Brodley, C.E., Kak, A., Broderick, L.S., Aisen, A.M. (2003). Unsupervised feature selection applied to content-based retrieval of lung images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 25(3), 373–378.CrossRef
Zurück zum Zitat Feng, Y., & Lapata, M. (2008). Automatic image annotation using auxiliary text information. Proc. ACL (pp. 272–280). Columbus, Ohio, USA. Feng, Y., & Lapata, M. (2008). Automatic image annotation using auxiliary text information. Proc. ACL (pp. 272–280). Columbus, Ohio, USA.
Zurück zum Zitat Fergus, R., Perona, P., Zisserman, A. (2004). A visual category filter for Google images. Proc. International Conference on Computer Vision ECCV (pp. 242–256). Fergus, R., Perona, P., Zisserman, A. (2004). A visual category filter for Google images. Proc. International Conference on Computer Vision ECCV (pp. 242–256).
Zurück zum Zitat Gao, Y., & Fan, J.P. (2006). Incorporating concept ontology to enable probabilistic concept reasoning for multi-level image annotation. Proc. 8th ACM international workshop on Multimedia information retrieval (SIGMM) (pp. 79–88). Gao, Y., & Fan, J.P. (2006). Incorporating concept ontology to enable probabilistic concept reasoning for multi-level image annotation. Proc. 8th ACM international workshop on Multimedia information retrieval (SIGMM) (pp. 79–88).
Zurück zum Zitat Gupta, A., Rafatirad, S., Gao, M., Jain, R. (2009). Medialife: from images to a life chronicle. Proc. 35th SIGMOD international conference on Management of data (SIGMOD). Gupta, A., Rafatirad, S., Gao, M., Jain, R. (2009). Medialife: from images to a life chronicle. Proc. 35th SIGMOD international conference on Management of data (SIGMOD).
Zurück zum Zitat Hauptmann, A.G. (2004). Towards a large scale concept ontology for broadcast video. Proc. ACM International Conference on Image and Video Retrieval (CIVR) (pp. 674–675). Hauptmann, A.G. (2004). Towards a large scale concept ontology for broadcast video. Proc. ACM International Conference on Image and Video Retrieval (CIVR) (pp. 674–675).
Zurück zum Zitat Hsu, W., Kennedy, L., Chang, S.-F. (2007). Novel reranking methods for visual search. IEEE Multimedia, 14(3), 14–22.CrossRef Hsu, W., Kennedy, L., Chang, S.-F. (2007). Novel reranking methods for visual search. IEEE Multimedia, 14(3), 14–22.CrossRef
Zurück zum Zitat Hull, D. (1993). Using statistical testing in the evaluation of retrieval experiments. Proc. 16nd annual international ACM SIGIR (pp. 329–338). Hull, D. (1993). Using statistical testing in the evaluation of retrieval experiments. Proc. 16nd annual international ACM SIGIR (pp. 329–338).
Zurück zum Zitat Lavrenko, V., Manmatha, R., Jeon, J. (2003). A model for learning the semantics of pictures. Proc. 17th Annual Conference on Neural Information Processing Systems (NIPS). Lavrenko, V., Manmatha, R., Jeon, J. (2003). A model for learning the semantics of pictures. Proc. 17th Annual Conference on Neural Information Processing Systems (NIPS).
Zurück zum Zitat Li, J., & Wang, J.Z. (2008). Real-time computerized annotation of pictures. IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(6), 985–1002.CrossRef Li, J., & Wang, J.Z. (2008). Real-time computerized annotation of pictures. IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(6), 985–1002.CrossRef
Zurück zum Zitat Lin, W.-H., Jin, R., Hauptmann, A. (2003). Web image retrieval re-ranking with relevance model. Proc. IEEE Web Intelligence Consortium (WIC). Lin, W.-H., Jin, R., Hauptmann, A. (2003). Web image retrieval re-ranking with relevance model. Proc. IEEE Web Intelligence Consortium (WIC).
Zurück zum Zitat Lowe, D. (2004). Distinctive image features from scale-invariant keypoints. Journal of Computer Vision, 2(60), 91–110.CrossRef Lowe, D. (2004). Distinctive image features from scale-invariant keypoints. Journal of Computer Vision, 2(60), 91–110.CrossRef
Zurück zum Zitat Natsev, A., Naphade, M.R., Tesic, J. (2005). Learning the semantics of multimedia queries and concepts from a small number of examples. Proc. ACM Int’l Conf. Multimedia (598–607). ACM Press. Natsev, A., Naphade, M.R., Tesic, J. (2005). Learning the semantics of multimedia queries and concepts from a small number of examples. Proc. ACM Int’l Conf. Multimedia (598–607). ACM Press.
Zurück zum Zitat Natsev, A., Rastogi, R., Shim, K. (2004). WALRUS: a similarity retrieval algorithm for image databases. IEEE Transactions on Knowledge and Data Engineering, 16(3), 316.CrossRef Natsev, A., Rastogi, R., Shim, K. (2004). WALRUS: a similarity retrieval algorithm for image databases. IEEE Transactions on Knowledge and Data Engineering, 16(3), 316.CrossRef
Zurück zum Zitat Rasiwasia, N., Costa Pereira, J., Coviello, E., Doyle, G., Lanckriet, G.R.G., Levy, R., Vasconcelos, N. (2010). A new approach to cross-modal multimedia retrieval. ACM Proceedings of the 15th international conference on multimedia. Florence, Italy. Rasiwasia, N., Costa Pereira, J., Coviello, E., Doyle, G., Lanckriet, G.R.G., Levy, R., Vasconcelos, N. (2010). A new approach to cross-modal multimedia retrieval. ACM Proceedings of the 15th international conference on multimedia. Florence, Italy.
Zurück zum Zitat Rasiwasia, N., Moreno, P., Vasconcelos, N. (2007). Bridging the gap: query by semantic example. IEEE Transactions on Multimedia, 9(5), 923–938.CrossRef Rasiwasia, N., Moreno, P., Vasconcelos, N. (2007). Bridging the gap: query by semantic example. IEEE Transactions on Multimedia, 9(5), 923–938.CrossRef
Zurück zum Zitat Santos, E. Jr, Nguyen, H., Brown, S.M. (2001). Kavanah: an active user interface information retrieval agent technology (pp. 412–423). Japan: Maebashi. Santos, E. Jr, Nguyen, H., Brown, S.M. (2001). Kavanah: an active user interface information retrieval agent technology (pp. 412–423). Japan: Maebashi.
Zurück zum Zitat Santos, E. Jr, Santos, E., Nguyen, H., Pan, L., Korah, J., Huadong, X., Yu, F., Li, D. (2008). Analyst-ready large scale real time retrieval tool for e-governance. E-Government Diffusion, Policy and Impact: Advanced Issues and Practices, IGI Global, ISBN: 978-1-60566-130-8. Santos, E. Jr, Santos, E., Nguyen, H., Pan, L., Korah, J., Huadong, X., Yu, F., Li, D. (2008). Analyst-ready large scale real time retrieval tool for e-governance. E-Government Diffusion, Policy and Impact: Advanced Issues and Practices, IGI Global, ISBN: 978-1-60566-130-8.
Zurück zum Zitat Schroff, F., Criminisi, A., Zisserman, A. (2007). Harvesting image databases from the web. Proc. IEEE 11th International Conference Computer Vision (ICCV) (pp. 14–21). Schroff, F., Criminisi, A., Zisserman, A. (2007). Harvesting image databases from the web. Proc. IEEE 11th International Conference Computer Vision (ICCV) (pp. 14–21).
Zurück zum Zitat Sebe, N., & Lew, M.S. (2003). Comparing salient points detectors. Journal of Pattern Recognition Letters, 24(1–3), 89–96.CrossRefMATH Sebe, N., & Lew, M.S. (2003). Comparing salient points detectors. Journal of Pattern Recognition Letters, 24(1–3), 89–96.CrossRefMATH
Zurück zum Zitat Snoek, C.G. M., & Smeulders, A.W.M. (2010). Visual-concept search solved?Computer, 43(6), 76–78.CrossRef Snoek, C.G. M., & Smeulders, A.W.M. (2010). Visual-concept search solved?Computer, 43(6), 76–78.CrossRef
Zurück zum Zitat Taneva, B., Kacimi, M., Weikum, G. (2010). Gathering and ranking photos of named entities with high precision, high recall, and diversity. Proc. ACM International Conference on Web Search and Data Mining (WSDM). Taneva, B., Kacimi, M., Weikum, G. (2010). Gathering and ranking photos of named entities with high precision, high recall, and diversity. Proc. ACM International Conference on Web Search and Data Mining (WSDM).
Zurück zum Zitat Tao, D., Tang, X., Li, X., Wu, X. (2006). Asymmetric bagging and random subspace for support vector machines-based relevance feedback in image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence (pp. 1088–1099). Tao, D., Tang, X., Li, X., Wu, X. (2006). Asymmetric bagging and random subspace for support vector machines-based relevance feedback in image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence (pp. 1088–1099).
Zurück zum Zitat Tsai, Y.H. (2009). Salient points reduction for content-based image retrieval. World Academy of Science, Engineering and Technology (WASET) (vol. 49). Tsai, Y.H. (2009). Salient points reduction for content-based image retrieval. World Academy of Science, Engineering and Technology (WASET) (vol. 49).
Zurück zum Zitat Üstün, B., Melssen, W.J., L.Buydens, M.C. Facilitating the application of support vector regression by using a universal pearson VII function based kernel. Chemometrics Intel. Lab. Syst (pp. 29–40). Chemometrics Intel. Üstün, B., Melssen, W.J., L.Buydens, M.C. Facilitating the application of support vector regression by using a universal pearson VII function based kernel. Chemometrics Intel. Lab. Syst (pp. 29–40). Chemometrics Intel.
Zurück zum Zitat Veltkamp, R.C., & Tanase, M. (2000). Content-based image retrieval systems: A survey. Technique. Report No. UU-CS-2000-34. Veltkamp, R.C., & Tanase, M. (2000). Content-based image retrieval systems: A survey. Technique. Report No. UU-CS-2000-34.
Zurück zum Zitat Vu, K., Hua, K.A., Tavanapong, W. (2003). Image retrieval based on regions of interest. IEEE Transactions on Knowledge and Data Engineering, 15(4), 1045–1049.CrossRef Vu, K., Hua, K.A., Tavanapong, W. (2003). Image retrieval based on regions of interest. IEEE Transactions on Knowledge and Data Engineering, 15(4), 1045–1049.CrossRef
Zurück zum Zitat Wang, J.Z., Li, J., Wiederhold, G. (2001). SIMPLIcity: semantics–sensitive integrated matching for picture libraries. IEEE Transactions on Pattern Analysis and Machine Intelligence, 23(9), 947–963.CrossRef Wang, J.Z., Li, J., Wiederhold, G. (2001). SIMPLIcity: semantics–sensitive integrated matching for picture libraries. IEEE Transactions on Pattern Analysis and Machine Intelligence, 23(9), 947–963.CrossRef
Zurück zum Zitat Wang, X., Qiu, S., Liu, K., Tang, X. (2013). In Web image re-ranking using query-specific semantic signatures. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI). Wang, X., Qiu, S., Liu, K., Tang, X. (2013). In Web image re-ranking using query-specific semantic signatures. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI).
Zurück zum Zitat Wang, C.H., Zhang, L., Zhang, H.J. (2008). Learning to reduce the semantic gap in web image retrieval and annotation. Proc. 31st annual international ACM SIGIR (pp. 355–362). Wang, C.H., Zhang, L., Zhang, H.J. (2008). Learning to reduce the semantic gap in web image retrieval and annotation. Proc. 31st annual international ACM SIGIR (pp. 355–362).
Zurück zum Zitat Yan, R., Hauptmann, A., Jin, R. (2003). Multimedia search with pseudo-relevance feedback. Proc. Int’l Conf. Image and Video Retrieval, LNCS 2728 (pp. 238–247). Springer. Yan, R., Hauptmann, A., Jin, R. (2003). Multimedia search with pseudo-relevance feedback. Proc. Int’l Conf. Image and Video Retrieval, LNCS 2728 (pp. 238–247). Springer.
Zurück zum Zitat Zhao, R., & Grosky, W.I. (2002). Negotiating the semantic gap: from feature maps to semantic landscapes. Journal of Pattern Recognition, 35(3), 593–600.CrossRefMATH Zhao, R., & Grosky, W.I. (2002). Negotiating the semantic gap: from feature maps to semantic landscapes. Journal of Pattern Recognition, 35(3), 593–600.CrossRefMATH
Zurück zum Zitat Zhou, Z-H., Sun, Y-Y., Li, Y-F. (2009). Multi-instance learning by treating instances as non-i.i.d. samples. Proceedings of the 26th annual international conference on machine learning (pp. 1249–1256). Zhou, Z-H., Sun, Y-Y., Li, Y-F. (2009). Multi-instance learning by treating instances as non-i.i.d. samples. Proceedings of the 26th annual international conference on machine learning (pp. 1249–1256).
Zurück zum Zitat Zhou, Z-H., & Zhang, M-L. (2007). Multi-instance multi-label learning with application to scene classification. Advances in Neural Information Processing Systems 19 (NIPS’06). Cambridge: MIT press (pp. 1609–1616). Zhou, Z-H., & Zhang, M-L. (2007). Multi-instance multi-label learning with application to scene classification. Advances in Neural Information Processing Systems 19 (NIPS’06). Cambridge: MIT press (pp. 1609–1616).
Metadaten
Titel
Automatic content based image retrieval using semantic analysis
verfasst von
Eugene Santos Jr.
Qi Gu
Publikationsdatum
01.10.2014
Verlag
Springer US
Erschienen in
Journal of Intelligent Information Systems / Ausgabe 2/2014
Print ISSN: 0925-9902
Elektronische ISSN: 1573-7675
DOI
https://doi.org/10.1007/s10844-014-0321-8

Weitere Artikel der Ausgabe 2/2014

Journal of Intelligent Information Systems 2/2014 Zur Ausgabe

Premium Partner