nach oben

Journal of Intelligent Information Systems

Erschienen in:

01.10.2014

Automatic content based image retrieval using semantic analysis

verfasst von: Eugene Santos Jr., Qi Gu

Erschienen in: Journal of Intelligent Information Systems | Ausgabe 2/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

We present a new text-to-image re-ranking approach for improving the relevancy rate in searches. In particular, we focus on the fundamental semantic gap that exists between the low-level visual features of the image and high-level textual queries by dynamically maintaining a connected hierarchy in the form of a concept database. For each textual query, we take the results from popular search engines as an initial retrieval, followed by a semantic analysis to map the textual query to higher level concepts. In order to do this, we design a two-layer scoring system which can identify the relationship between the query and the concepts automatically. We then calculate the image feature vectors and compare them with the classifier for each related concept. An image is relevant only when it is related to the query both semantically and content-wise. The second feature of this work is that we loosen the requirement for query accuracy from the user, which makes it possible to perform well on users’ queries containing less relevant information. Thirdly, the concept database can be dynamically maintained to satisfy the variations in user queries, which eliminates the need for human labor in building a sophisticated initial concept database. We designed our experiment using complex queries (based on five scenarios) to demonstrate how our retrieval results are a significant improvement over those obtained from current state-of-the-art image search engines.

Vorheriger Artikel Learning from syntax generalizations for automatic semantic annotation

Nächster Artikel Image understanding and the web: a state-of-the-art review

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

The image data can be downloaded from: http://labelme.csail.mit.edu/

Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.G. (2007). Dbpedia: A nucleus for a web of open data. Proc. International Semantic Web Conference (ISWC) (pp. 11–15).

Banko, M., Cafarella, M.J., Soderland, S., Broadhead, M., Etzioni, O. (2007). In “Open information extraction from the web,”: Proc. International Joint Conference on Artificial Intelligence (IJCAI).

Benitez, A.B., & Chang, S.-F. (2003). Image classification using multimedia knowledge networks. Proc. IEEE International Conference on Image Processing (ICIP) (pp. 613–616).

Blei, D., Ng, A., Jordan, M. (2003). Latent Dirichlet allocation. Journal of Machine Learning Research, 3, 993–1022.MATH

Bosch, A., Zisserman, A., Muñoz, X. (2007). Image classification using random forests and ferns. Proc. International Conference Computer Vision (ICCV).

Carbonell, J.G., et al. (1997). Translingual information retrieval: A comparative evaluation. Proc. int’l joint conf. artificial intelligence. Morgan Kaufmann, 708–715.

Carneiro, G., Chan, A.B., Moreno, P.J., Vasconcelos, N. (2006). Supervised learning of semantic classes for image annotation and retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 29(3), 394–410.CrossRef

Carson, C., Belongie, S., Greenspan, H., Malik, J. (2002). Blobworld: image segmentation using expectation-maximization and its application to image querying. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(8), 1026–1038.CrossRef

Chen, Y., & Wang, J.Z. (2002). A region-based fuzzy feature matching approach to content-based image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(9), 252–1267.

Chen, Y.X., & Wang, J.Z. (2004). Image categorization by learning and reasoning with regions. Journal of Machine Learning Research, 5, 913–939 .

Cowling, P.I., Remde, S.M., Hartley, P., Stewart, W., Stock-Brooks, J., Woolley, T. (2010). C-Link: concept linkage in knowledge repositories. AAAI Spring Symposium Series.

Dy, J.G., Brodley, C.E., Kak, A., Broderick, L.S., Aisen, A.M. (2003). Unsupervised feature selection applied to content-based retrieval of lung images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 25(3), 373–378.CrossRef

Feng, Y., & Lapata, M. (2008). Automatic image annotation using auxiliary text information. Proc. ACL (pp. 272–280). Columbus, Ohio, USA.

Fergus, R., Perona, P., Zisserman, A. (2004). A visual category filter for Google images. Proc. International Conference on Computer Vision ECCV (pp. 242–256).

Gao, Y., & Fan, J.P. (2006). Incorporating concept ontology to enable probabilistic concept reasoning for multi-level image annotation. Proc. 8th ACM international workshop on Multimedia information retrieval (SIGMM) (pp. 79–88).

Gupta, A., Rafatirad, S., Gao, M., Jain, R. (2009). Medialife: from images to a life chronicle. Proc. 35th SIGMOD international conference on Management of data (SIGMOD).

Hauptmann, A.G. (2004). Towards a large scale concept ontology for broadcast video. Proc. ACM International Conference on Image and Video Retrieval (CIVR) (pp. 674–675).

Hsu, W., Kennedy, L., Chang, S.-F. (2007). Novel reranking methods for visual search. IEEE Multimedia, 14(3), 14–22.CrossRef

Hull, D. (1993). Using statistical testing in the evaluation of retrieval experiments. Proc. 16nd annual international ACM SIGIR (pp. 329–338).

Lavrenko, V., Manmatha, R., Jeon, J. (2003). A model for learning the semantics of pictures. Proc. 17th Annual Conference on Neural Information Processing Systems (NIPS).

Li, J., & Wang, J.Z. (2008). Real-time computerized annotation of pictures. IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(6), 985–1002.CrossRef

Lin, W.-H., Jin, R., Hauptmann, A. (2003). Web image retrieval re-ranking with relevance model. Proc. IEEE Web Intelligence Consortium (WIC).

Lowe, D. (2004). Distinctive image features from scale-invariant keypoints. Journal of Computer Vision, 2(60), 91–110.CrossRef

Natsev, A., Naphade, M.R., Tesic, J. (2005). Learning the semantics of multimedia queries and concepts from a small number of examples. Proc. ACM Int’l Conf. Multimedia (598–607). ACM Press.

Natsev, A., Rastogi, R., Shim, K. (2004). WALRUS: a similarity retrieval algorithm for image databases. IEEE Transactions on Knowledge and Data Engineering, 16(3), 316.CrossRef

Rasiwasia, N., Costa Pereira, J., Coviello, E., Doyle, G., Lanckriet, G.R.G., Levy, R., Vasconcelos, N. (2010). A new approach to cross-modal multimedia retrieval. ACM Proceedings of the 15th international conference on multimedia. Florence, Italy.

Rasiwasia, N., Moreno, P., Vasconcelos, N. (2007). Bridging the gap: query by semantic example. IEEE Transactions on Multimedia, 9(5), 923–938.CrossRef

Santos, E. Jr, Nguyen, H., Brown, S.M. (2001). Kavanah: an active user interface information retrieval agent technology (pp. 412–423). Japan: Maebashi.

Santos, E. Jr, Santos, E., Nguyen, H., Pan, L., Korah, J., Huadong, X., Yu, F., Li, D. (2008). Analyst-ready large scale real time retrieval tool for e-governance. E-Government Diffusion, Policy and Impact: Advanced Issues and Practices, IGI Global, ISBN: 978-1-60566-130-8.

Schroff, F., Criminisi, A., Zisserman, A. (2007). Harvesting image databases from the web. Proc. IEEE 11th International Conference Computer Vision (ICCV) (pp. 14–21).

Sebe, N., & Lew, M.S. (2003). Comparing salient points detectors. Journal of Pattern Recognition Letters, 24(1–3), 89–96.CrossRefMATH

Snoek, C.G. M., & Smeulders, A.W.M. (2010). Visual-concept search solved?Computer, 43(6), 76–78.CrossRef

Taneva, B., Kacimi, M., Weikum, G. (2010). Gathering and ranking photos of named entities with high precision, high recall, and diversity. Proc. ACM International Conference on Web Search and Data Mining (WSDM).

Tao, D., Tang, X., Li, X., Wu, X. (2006). Asymmetric bagging and random subspace for support vector machines-based relevance feedback in image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence (pp. 1088–1099).

Tsai, Y.H. (2009). Salient points reduction for content-based image retrieval. World Academy of Science, Engineering and Technology (WASET) (vol. 49).

Üstün, B., Melssen, W.J., L.Buydens, M.C. Facilitating the application of support vector regression by using a universal pearson VII function based kernel. Chemometrics Intel. Lab. Syst (pp. 29–40). Chemometrics Intel.

Veltkamp, R.C., & Tanase, M. (2000). Content-based image retrieval systems: A survey. Technique. Report No. UU-CS-2000-34.

Vu, K., Hua, K.A., Tavanapong, W. (2003). Image retrieval based on regions of interest. IEEE Transactions on Knowledge and Data Engineering, 15(4), 1045–1049.CrossRef

Wang, J.Z., Li, J., Wiederhold, G. (2001). SIMPLIcity: semantics–sensitive integrated matching for picture libraries. IEEE Transactions on Pattern Analysis and Machine Intelligence, 23(9), 947–963.CrossRef

Wang, X., Qiu, S., Liu, K., Tang, X. (2013). In Web image re-ranking using query-specific semantic signatures. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI).

Wang, C.H., Zhang, L., Zhang, H.J. (2008). Learning to reduce the semantic gap in web image retrieval and annotation. Proc. 31st annual international ACM SIGIR (pp. 355–362).

Yan, R., Hauptmann, A., Jin, R. (2003). Multimedia search with pseudo-relevance feedback. Proc. Int’l Conf. Image and Video Retrieval, LNCS 2728 (pp. 238–247). Springer.

Zhao, R., & Grosky, W.I. (2002). Negotiating the semantic gap: from feature maps to semantic landscapes. Journal of Pattern Recognition, 35(3), 593–600.CrossRefMATH

Zhou, Z-H., Sun, Y-Y., Li, Y-F. (2009). Multi-instance learning by treating instances as non-i.i.d. samples. Proceedings of the 26th annual international conference on machine learning (pp. 1249–1256).

Zhou, Z-H., & Zhang, M-L. (2007). Multi-instance multi-label learning with application to scene classification. Advances in Neural Information Processing Systems 19 (NIPS’06). Cambridge: MIT press (pp. 1609–1616).

Titel: Automatic content based image retrieval using semantic analysis
verfasst von: Eugene Santos Jr.
Qi Gu
Publikationsdatum: 01.10.2014
Verlag: Springer US
Erschienen in: Journal of Intelligent Information Systems / Ausgabe 2/2014
Print ISSN: 0925-9902
Elektronische ISSN: 1573-7675
DOI: https://doi.org/10.1007/s10844-014-0321-8

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Weitere Artikel der Ausgabe 2/2014

DART+: Direction-aware bichromatic reverse k nearest neighbor query processing in spatial databases

Image understanding and the web: a state-of-the-art review

Improving process models by mining mappings of low-level events to high-level activities

Learning from syntax generalizations for automatic semantic annotation

A rule-based expert system for earthquake prediction

Consistent feature selection and its application to face recognition

Premium Partner