Skip to main content

2015 | OriginalPaper | Buchkapitel

An LDA Topic Model Adaptation for Context-Based Image Retrieval

verfasst von : Hatem Aouadi, Mouna Torjmen Khemakhem, Maher Ben Jemaa

Erschienen in: E-Commerce and Web Technologies

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In the context-based image retrieval, the textual information surrounding the image plays a central role for ranking returned results. Although this technique outperforms content-based approaches, it may fail when the query keywords does not match the textual content of many documents containing relevant images. In addition, users are usually not experts and provide ambiguous queries that lead to heterogeneous results. To solve these problems, researchers are trying to re-rank primary results using other techniques such as query expansion, concept-based retrieval, etc. In this paper, we propose to use LDA topic model to re-rank results and therefore improve retrieval precision. We apply this model in two levels: global level represented by the whole document containing the image and local level represented by the paragraph containing an image (considered as a specific textual information for the image). Results show a significant improvement over the standard text retrieval approach by re-ranking with the LDA model applied to the local level.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Arora, S., Ge, R., Moitra A.: Learning topic models - Going beyond SVD. In: IEEE 53rd Annual Symposium on Foundations of Computer Science, pp. 1–10 (2012) Arora, S., Ge, R., Moitra A.: Learning topic models - Going beyond SVD. In: IEEE 53rd Annual Symposium on Foundations of Computer Science, pp. 1–10 (2012)
2.
Zurück zum Zitat Barnard, K., Duygulu, P., Forsyth, D.A., de Freitas, N., Blei, D.M., Jordan, M.I.: Matching words and pictures. J. Mach. Learn. Res. 3, 1107–1135 (2003)MATH Barnard, K., Duygulu, P., Forsyth, D.A., de Freitas, N., Blei, D.M., Jordan, M.I.: Matching words and pictures. J. Mach. Learn. Res. 3, 1107–1135 (2003)MATH
3.
Zurück zum Zitat Blei, D.M., Jordan, M.I.: Modeling annotated data. SIGIR 2003: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 127–134. ACM (2003) Blei, D.M., Jordan, M.I.: Modeling annotated data. SIGIR 2003: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 127–134. ACM (2003)
4.
Zurück zum Zitat Blei, D.M., Ng, A.Y., Jordan, M.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)MATH Blei, D.M., Ng, A.Y., Jordan, M.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)MATH
5.
Zurück zum Zitat Chaney, A.J.B., Blei, D.M.: Visualizing topic models. In: International AAAI Conference on Social Media and Weblogs (2012) Chaney, A.J.B., Blei, D.M.: Visualizing topic models. In: International AAAI Conference on Social Media and Weblogs (2012)
6.
Zurück zum Zitat Cheng, D., He, X., Liu, Y.: Analyzing the Number of Latent Topics via Spectral Decomposition. arXiv preprint arXiv:1410.6466 (2014) Cheng, D., He, X., Liu, Y.: Analyzing the Number of Latent Topics via Spectral Decomposition. arXiv preprint arXiv:​1410.​6466 (2014)
7.
Zurück zum Zitat El Demerdash, O., Kosseim, L., Bergler, S.: Image retrieval by inter-media fusion and pseudo-relevance feedback. In: Peters, C., Deselaers, T., Ferro, N., Gonzalo, J., Jones, G.J.F., Kurimo, M., Mandl, T., Peñas, A., Petras, V. (eds.) CLEF 2008. LNCS, vol. 5706, pp. 605–611. Springer, Heidelberg (2009) CrossRef El Demerdash, O., Kosseim, L., Bergler, S.: Image retrieval by inter-media fusion and pseudo-relevance feedback. In: Peters, C., Deselaers, T., Ferro, N., Gonzalo, J., Jones, G.J.F., Kurimo, M., Mandl, T., Peñas, A., Petras, V. (eds.) CLEF 2008. LNCS, vol. 5706, pp. 605–611. Springer, Heidelberg (2009) CrossRef
8.
Zurück zum Zitat Griffiths, T., Steyvers, M.: Finding scientific topics. Proc. Natl. Acad. Sci. U.S.A. 101, 5228–5235 (2004)CrossRef Griffiths, T., Steyvers, M.: Finding scientific topics. Proc. Natl. Acad. Sci. U.S.A. 101, 5228–5235 (2004)CrossRef
9.
Zurück zum Zitat Gulati, P., Sharma, A.K.: Ontology Driven Query Expansion for Better Image Retrieval. Int. J. Comput. Appl. 5(10), 33–37 (2010) Gulati, P., Sharma, A.K.: Ontology Driven Query Expansion for Better Image Retrieval. Int. J. Comput. Appl. 5(10), 33–37 (2010)
10.
Zurück zum Zitat Harashima, J., Kurohashi, S.: Relevance feedback using latent information. In: Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, pp. 1037–1045 (2011) Harashima, J., Kurohashi, S.: Relevance feedback using latent information. In: Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, pp. 1037–1045 (2011)
11.
Zurück zum Zitat Hoffman, M., Blei, D., Cook, P.: Content-based musical similarity computation using the hierarchical Dirichlet process. In: ISMIR 2008–9th International Conference on Music Information Retrieval, pp. 349–354 (2008) Hoffman, M., Blei, D., Cook, P.: Content-based musical similarity computation using the hierarchical Dirichlet process. In: ISMIR 2008–9th International Conference on Music Information Retrieval, pp. 349–354 (2008)
12.
Zurück zum Zitat Hong, L., Davison, B.D.: Empirical study of topic modeling in twitter. In: Proceedings of the First Workshop on Social Media Analytics, pp. 80–88. ACM (2010) Hong, L., Davison, B.D.: Empirical study of topic modeling in twitter. In: Proceedings of the First Workshop on Social Media Analytics, pp. 80–88. ACM (2010)
13.
Zurück zum Zitat Hörster, E., Lienhart, R., Slaney, M.: Image retrieval on large-scale image databases. In: CIVR 2007: Proceedings of the 6th ACM International Conference on Image and Video Retrieval, pp. 17–24. ACM (2007) Hörster, E., Lienhart, R., Slaney, M.: Image retrieval on large-scale image databases. In: CIVR 2007: Proceedings of the 6th ACM International Conference on Image and Video Retrieval, pp. 17–24. ACM (2007)
14.
Zurück zum Zitat Juan, C., Jintao, L., Yongdong, Z., Sheng, T.: LDA-based retrieval framework for semantic news video retrieval. In: International Conference on Semantic Computing. ICSC, IEEE Computer Society, pp. 155–160 (2007) Juan, C., Jintao, L., Yongdong, Z., Sheng, T.: LDA-based retrieval framework for semantic news video retrieval. In: International Conference on Semantic Computing. ICSC, IEEE Computer Society, pp. 155–160 (2007)
16.
Zurück zum Zitat Leung, C.H., Li, Y.: Comparison of different ontology-based query expansion algorithms for effective image retrieval. In: Kim, T.-H., Adeli, H., Ramos, C., Kang, B.-H. (eds.) Signal Processing. Image Processing and Pattern Recognition. Springer, Heidelberg (2011) Leung, C.H., Li, Y.: Comparison of different ontology-based query expansion algorithms for effective image retrieval. In: Kim, T.-H., Adeli, H., Ramos, C., Kang, B.-H. (eds.) Signal Processing. Image Processing and Pattern Recognition. Springer, Heidelberg (2011)
17.
Zurück zum Zitat Lu, C., Hu, X., Chen, X., Park, J., He, T., Li, Z.: Probabilistic models for topic learning from images and captions in online biomedical literatures. In: Proceedings of the 18th ACM Conference on Information and Knowledge Management, pp. 495–504 (2009) Lu, C., Hu, X., Chen, X., Park, J., He, T., Li, Z.: Probabilistic models for topic learning from images and captions in online biomedical literatures. In: Proceedings of the 18th ACM Conference on Information and Knowledge Management, pp. 495–504 (2009)
18.
Zurück zum Zitat Maillot, N., Chevallet, J.-P., Valea, V., Lim, J. H.: IPAL Inter-Media Pseudo-Relevance Feedback Approach to ImageCLEF 2006 Photo Retrieval. Working Notes for the CLEF 2006 Workshop (2006) Maillot, N., Chevallet, J.-P., Valea, V., Lim, J. H.: IPAL Inter-Media Pseudo-Relevance Feedback Approach to ImageCLEF 2006 Photo Retrieval. Working Notes for the CLEF 2006 Workshop (2006)
19.
Zurück zum Zitat Navigli, R., Ponzetto, S.P.: BabelNet : Building a very large multilingual semantic network. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, Uppsala, Sweden, pp. 216–225 (2010) Navigli, R., Ponzetto, S.P.: BabelNet : Building a very large multilingual semantic network. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, Uppsala, Sweden, pp. 216–225 (2010)
20.
Zurück zum Zitat Nguyen, C.T., Kaothanthong, N., Phan, X.H., Tokuyama, T.: A feature-word-topic model for image annotation. In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management. ACM, pp. 1481–1484 (2010) Nguyen, C.T., Kaothanthong, N., Phan, X.H., Tokuyama, T.: A feature-word-topic model for image annotation. In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management. ACM, pp. 1481–1484 (2010)
21.
Zurück zum Zitat Putthividhya, D., Attias, H.T., Nagarajan, S.S.: Supervised topic model for automatic image annotation. In: 2010 IEEE International Conference on Acoustics Speech and Signal Processing, pp. 1894–1897. IEEE (2010) Putthividhya, D., Attias, H.T., Nagarajan, S.S.: Supervised topic model for automatic image annotation. In: 2010 IEEE International Conference on Acoustics Speech and Signal Processing, pp. 1894–1897. IEEE (2010)
22.
Zurück zum Zitat Serizawa, M., Kobayashi, I.: A study on query expansion based on topic distributions of retrieved documents. In: Gelbukh, A. (ed.) CICLing 2013, Part II. LNCS, vol. 7817, pp. 369–379. Springer, Heidelberg (2013) CrossRef Serizawa, M., Kobayashi, I.: A study on query expansion based on topic distributions of retrieved documents. In: Gelbukh, A. (ed.) CICLing 2013, Part II. LNCS, vol. 7817, pp. 369–379. Springer, Heidelberg (2013) CrossRef
23.
Zurück zum Zitat Tang, S., Zheng, Y., Cao, G., Zhang, Y., Li, J.: Ensemble Learning with LDA Topic Models for Visual Concept Detection. In: Multimedia - A Multidisciplinary Approach to Complex, Issues, pp. 175–200 (2012) Tang, S., Zheng, Y., Cao, G., Zhang, Y., Li, J.: Ensemble Learning with LDA Topic Models for Visual Concept Detection. In: Multimedia - A Multidisciplinary Approach to Complex, Issues, pp. 175–200 (2012)
24.
Zurück zum Zitat Teh, Y.W., Newman, D., Welling, M.: A collapsed variational Bayesian inference algorithm for latent Dirichlet allocation. In: Advances in Neural Information Processing systems, pp. 1353–1360 (2006) Teh, Y.W., Newman, D., Welling, M.: A collapsed variational Bayesian inference algorithm for latent Dirichlet allocation. In: Advances in Neural Information Processing systems, pp. 1353–1360 (2006)
25.
Zurück zum Zitat Troelsgård, R., Jensen, B.S., Hansen, L.K.: A Topic Model Approach to Multi-Modal Similarity. CoRR (2014) Troelsgård, R., Jensen, B.S., Hansen, L.K.: A Topic Model Approach to Multi-Modal Similarity. CoRR (2014)
26.
Zurück zum Zitat Ullah, R., Jaafar, J.: Exploiting short query expansion for images retrieval. International Conference on Computer & Information Science (ICCIS), vol. 1, pp. 352–356. IEEE(2012) Ullah, R., Jaafar, J.: Exploiting short query expansion for images retrieval. International Conference on Computer & Information Science (ICCIS), vol. 1, pp. 352–356. IEEE(2012)
27.
Zurück zum Zitat Wei, X., Croft, W.B.: LDA-based document models for ad-hoc retrieval. In: Proceedings of the 29th Annual international ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 178–185. ACM (2006) Wei, X., Croft, W.B.: LDA-based document models for ad-hoc retrieval. In: Proceedings of the 29th Annual international ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 178–185. ACM (2006)
28.
Zurück zum Zitat Ye, Z., Huang, X., Lin, H.: Finding a good query-related topic for boosting pseudo-relevance feedback. J. Am. Soc. Inf. Sci. Technol. Arch. 62(4), 748–760 (2011)CrossRef Ye, Z., Huang, X., Lin, H.: Finding a good query-related topic for boosting pseudo-relevance feedback. J. Am. Soc. Inf. Sci. Technol. Arch. 62(4), 748–760 (2011)CrossRef
29.
Zurück zum Zitat Yi, X., Allan, J.: Evaluating topic models for information retrieval. In: Proceedings of the 17th ACM conference on Information and Knowledge management, pp. 1431–1432. ACM (2008) Yi, X., Allan, J.: Evaluating topic models for information retrieval. In: Proceedings of the 17th ACM conference on Information and Knowledge management, pp. 1431–1432. ACM (2008)
30.
Zurück zum Zitat Zhang, M., Luo, C.: A new ranking method based on latent dirichlet allocation. J. Comput. Inf. Syst. 8(24), 10141–10148 (2012) Zhang, M., Luo, C.: A new ranking method based on latent dirichlet allocation. J. Comput. Inf. Syst. 8(24), 10141–10148 (2012)
31.
Zurück zum Zitat Zhou, D., Wade, V.: Latent document re-ranking. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, vol. 3, pp. 1571–1580. Association for Computational Linguistics (2009) Zhou, D., Wade, V.: Latent document re-ranking. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, vol. 3, pp. 1571–1580. Association for Computational Linguistics (2009)
Metadaten
Titel
An LDA Topic Model Adaptation for Context-Based Image Retrieval
verfasst von
Hatem Aouadi
Mouna Torjmen Khemakhem
Maher Ben Jemaa
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-27729-5_6

Premium Partner