Skip to main content
Erschienen in: Multimedia Systems 6/2014

01.11.2014 | Regular Paper

Sparse semantic metric learning for image retrieval

verfasst von: Jing Liu, Zechao Li, Hanqing Lu

Erschienen in: Multimedia Systems | Ausgabe 6/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Typical content-based image retrieval solutions usually cannot achieve satisfactory performance due to the semantic gap challenge. With the popularity of social media applications, large amounts of social images associated with user tagging information are available, which can be leveraged to boost image retrieval. In this paper, we propose a sparse semantic metric learning (SSML) algorithm by discovering knowledge from these social media resources, and apply the learned metric to search relevant images for users. Different from the traditional metric learning approaches that use similar or dissimilar constraints over a homogeneous visual space, the proposed method exploits heterogeneous information from two views of images and formulates the learning problem with the following principles. The semantic structure in the text space is expected to be preserved for the transformed space. To prevent overfitting the noisy, incomplete, or subjective tagging information of images, we expect that the mapping space by the learned metric does not deviate from the original visual space. In addition, the metric is straightforward constrained to be row-wise sparse with the ℓ2,1-norm to suppress certain noisy or redundant visual feature dimensions. We present an iterative algorithm with proved convergence to solve the optimization problem. With the learned metric for image retrieval, we conduct extensive experiments on a real-world dataset and validate the effectiveness of our approach compared with other related work.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
4
In practice, \(\|{\bf m}^{l}\|_2\) could be close to zero but not zero. Theoretically, it could be zeros. For this case, we can regularize \(D_{ll}=\frac{1}{2\sqrt{\|{\bf m}^{l}\|_2^2+\epsilon}}, \) where \(\epsilon\) is very small constant. When \(\epsilon\rightarrow 0, \) we can see that \(\frac{1}{2\sqrt{\|{\bf m}^{l}\|_2^2+\epsilon}}\) approximates \(\frac{1}{2\|{\bf m}^{l}\|_2}. \)
 
Literatur
1.
Zurück zum Zitat Bar-Hillel, A., Weinshall, D.: Learning a mahalanobis metric from equivalence constraints. J. Mach. Learn. Res. 6, 937–965 (2005) MathSciNetMATH Bar-Hillel, A., Weinshall, D.: Learning a mahalanobis metric from equivalence constraints. J. Mach. Learn. Res. 6, 937–965 (2005) MathSciNetMATH
2.
Zurück zum Zitat Chua, T.S., Tang, J., Hong, R., Li, H., Luo, Z., Zheng Y.: Nus-wide: a real-world web image database from national university of singapore. In: ACM Conference on Image and Video Retrieval (2009) Chua, T.S., Tang, J., Hong, R., Li, H., Luo, Z., Zheng Y.: Nus-wide: a real-world web image database from national university of singapore. In: ACM Conference on Image and Video Retrieval (2009)
3.
Zurück zum Zitat Cox, T., Cox, M.: Multidimensional scaling. Chapman and Hall, London (2001)MATH Cox, T., Cox, M.: Multidimensional scaling. Chapman and Hall, London (2001)MATH
4.
Zurück zum Zitat Davis, J.V., Kulis, B., Jain, P., Sra, S., Dhillon, I.S.: Information-theoretic metric learning. In: International Conference on Machine Learning, pp. 209–216 (2007) Davis, J.V., Kulis, B., Jain, P., Sra, S., Dhillon, I.S.: Information-theoretic metric learning. In: International Conference on Machine Learning, pp. 209–216 (2007)
5.
Zurück zum Zitat Dhillon, I.S.: Co-clustering documents and words using bipartite spectral graph partitioning. In: ACM Conference on Knowledge Discovery and Data Mining (2001) Dhillon, I.S.: Co-clustering documents and words using bipartite spectral graph partitioning. In: ACM Conference on Knowledge Discovery and Data Mining (2001)
6.
Zurück zum Zitat Fukunaga, K.: Introduction to Statistical Pattern Recognition. Elsevier, Amsterdam (1990)MATH Fukunaga, K.: Introduction to Statistical Pattern Recognition. Elsevier, Amsterdam (1990)MATH
7.
Zurück zum Zitat Goldberger, J., Roweis, S., Hinton, G., Salakhutdinov, R.: Neighbourhood components analysis. In: Annual Conference on Neural Information Processing Systems NIPS (2005) Goldberger, J., Roweis, S., Hinton, G., Salakhutdinov, R.: Neighbourhood components analysis. In: Annual Conference on Neural Information Processing Systems NIPS (2005)
8.
Zurück zum Zitat Gu, Q., Zhou, J.: Co-clustering on manifolds. In: KDD (2009) Gu, Q., Zhou, J.: Co-clustering on manifolds. In: KDD (2009)
9.
Zurück zum Zitat Guan, N., Tao, D., Luo, Z., Shawe-Taylor, J.: Mahnmf: Manhattan non-negative matrix factorization. CoRR abs/1207.3438 (2012) Guan, N., Tao, D., Luo, Z., Shawe-Taylor, J.: Mahnmf: Manhattan non-negative matrix factorization. CoRR abs/1207.3438 (2012)
10.
Zurück zum Zitat Henrion, D., Malick, J.: Projection methods in convex optimization. LAAS-CNRS Research Report 10730 (2010) Henrion, D., Malick, J.: Projection methods in convex optimization. LAAS-CNRS Research Report 10730 (2010)
11.
Zurück zum Zitat Hoi, S.C.H., Liu, W., Lyu, M.R., Ma, W.Y.: Learning distance metrics with contextual constraints for image retrieval. In: CVPR (2006) Hoi, S.C.H., Liu, W., Lyu, M.R., Ma, W.Y.: Learning distance metrics with contextual constraints for image retrieval. In: CVPR (2006)
12.
Zurück zum Zitat Huang, K., Ying, Y., Campbell, C.: Gsml: A unified framework for sparse metric learning. In: International Conference on Data Mining, pp. 189–198 (2009) Huang, K., Ying, Y., Campbell, C.: Gsml: A unified framework for sparse metric learning. In: International Conference on Data Mining, pp. 189–198 (2009)
13.
Zurück zum Zitat Long, M., Wang, J., Ding, G., Shen, D., Yang, Q.: Transfer learning with graph co-regularization. In: AAAI (2012) Long, M., Wang, J., Ding, G., Shen, D., Yang, Q.: Transfer learning with graph co-regularization. In: AAAI (2012)
14.
Zurück zum Zitat Nie, F., Huang, H., Cai, X., Ding, C.H.Q.: Efficient and robust feature selection via joint l21-norms minimization. In: Annual Conference on Neural Information Processing Systems (2010) Nie, F., Huang, H., Cai, X., Ding, C.H.Q.: Efficient and robust feature selection via joint l21-norms minimization. In: Annual Conference on Neural Information Processing Systems (2010)
15.
Zurück zum Zitat Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 22(10):1345–1359 (2010)CrossRef Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 22(10):1345–1359 (2010)CrossRef
16.
Zurück zum Zitat Qi, G.J., Hua, X.S., Zhang, H.J.: Learning semantic distance from community-tagged media collection. In: ACM Multimedia (2009) Qi, G.J., Hua, X.S., Zhang, H.J.: Learning semantic distance from community-tagged media collection. In: ACM Multimedia (2009)
17.
Zurück zum Zitat Roweis, S., Saul, L.: Nonlinear dimensionality reduction by locally linear embedding. Science 290(5500), 2323–2326 (2000) CrossRef Roweis, S., Saul, L.: Nonlinear dimensionality reduction by locally linear embedding. Science 290(5500), 2323–2326 (2000) CrossRef
18.
Zurück zum Zitat Schultz, M., Joachims, T.: Learning a distance metric from relative comparisons. In: Advances in Neural Information Processing Systems (2003) Schultz, M., Joachims, T.: Learning a distance metric from relative comparisons. In: Advances in Neural Information Processing Systems (2003)
19.
Zurück zum Zitat Smeulders, A.W.M., Worring, M., Santini, S., Gupta, A., Jain, R.: Content-based image retrieval at the end of the early years. IEEE Trans. Pattern Anal. Mach. Intell. 22(12), 1349–1380 (2000)CrossRef Smeulders, A.W.M., Worring, M., Santini, S., Gupta, A., Jain, R.: Content-based image retrieval at the end of the early years. IEEE Trans. Pattern Anal. Mach. Intell. 22(12), 1349–1380 (2000)CrossRef
20.
Zurück zum Zitat Tenenbaum, J.B., de Silva, V., Langford, J.C.: A global geometric framework for nonlinear dimensionality reduction. Science 290(5500), 2319–2323 (2000)CrossRef Tenenbaum, J.B., de Silva, V., Langford, J.C.: A global geometric framework for nonlinear dimensionality reduction. Science 290(5500), 2319–2323 (2000)CrossRef
21.
Zurück zum Zitat Wang, C., Mahadevan, S.: Heterogeneous domain adaptation using manifold alignment. In: IJCAI (2011) Wang, C., Mahadevan, S.: Heterogeneous domain adaptation using manifold alignment. In: IJCAI (2011)
22.
Zurück zum Zitat Wang M., Yang K., Hua X.S., Zhang H.J.: Towards a relevant and diverse search of social images. IEEE Trans. Multimedia 12(8), 829–842 (2010)CrossRef Wang M., Yang K., Hua X.S., Zhang H.J.: Towards a relevant and diverse search of social images. IEEE Trans. Multimedia 12(8), 829–842 (2010)CrossRef
23.
Zurück zum Zitat Weinberger, K., Saul, L.: Distance metric learning for large margin nearest neighbor classification. J. Mach. Learn. Res. 10, 207–244 (2009)MATH Weinberger, K., Saul, L.: Distance metric learning for large margin nearest neighbor classification. J. Mach. Learn. Res. 10, 207–244 (2009)MATH
24.
Zurück zum Zitat Wu, L., Hoi, S.C., Jin, R., Zhu, J., Yu, N.: Distance metric learning from uncertain side information with application to automated photo tagging. In: ACM International Conference on Multimedia (2009) Wu, L., Hoi, S.C., Jin, R., Zhu, J., Yu, N.: Distance metric learning from uncertain side information with application to automated photo tagging. In: ACM International Conference on Multimedia (2009)
25.
Zurück zum Zitat Wu, P., Hoi, S.C., Zhao, P., He, Y.: Mining social images with distance metric learning for automated image tagging. In: ACM International Conference on Web Search and Data Mining (2011) Wu, P., Hoi, S.C., Zhao, P., He, Y.: Mining social images with distance metric learning for automated image tagging. In: ACM International Conference on Web Search and Data Mining (2011)
26.
Zurück zum Zitat Xing, E.P., Jordan, M.I., Karp, R.M., Russell, S.J.: Distance metric learning with application to clustering with side information. In: Advances in Neural Information Processing Systems, pp. 1489–1496 (2002) Xing, E.P., Jordan, M.I., Karp, R.M., Russell, S.J.: Distance metric learning with application to clustering with side information. In: Advances in Neural Information Processing Systems, pp. 1489–1496 (2002)
27.
Zurück zum Zitat Yang, L., Jin, R., Sukthankar, R., Liu, Y.: An efficient algorithm for local distance metric learning. In: AAAI (2006) Yang, L., Jin, R., Sukthankar, R., Liu, Y.: An efficient algorithm for local distance metric learning. In: AAAI (2006)
28.
Zurück zum Zitat Yang, Q., Chen, Y., Xue, G., W. Dai, Y.Y.: Heterogeneous transfer learning for image clustering via the social web. In: ACL/AFNLP (2009) Yang, Q., Chen, Y., Xue, G., W. Dai, Y.Y.: Heterogeneous transfer learning for image clustering via the social web. In: ACL/AFNLP (2009)
29.
Zurück zum Zitat Yang, Y., Shen, H.T., Ma, Z., Huang, Z., Zhou, X.: L21-norm regularized discriminative feature selection for unsupervised learning. In: International Joint Conference on Artificial Intelligence (2011) Yang, Y., Shen, H.T., Ma, Z., Huang, Z., Zhou, X.: L21-norm regularized discriminative feature selection for unsupervised learning. In: International Joint Conference on Artificial Intelligence (2011)
30.
Zurück zum Zitat Yu, J., Liu, D., D. Tao, H.S.S.: Complex object correspondence construction in two-dimensional animation. IEEE Trans. Image Process. 20(11), 3257–3269 (2011) Yu, J., Liu, D., D. Tao, H.S.S.: Complex object correspondence construction in two-dimensional animation. IEEE Trans. Image Process. 20(11), 3257–3269 (2011)
31.
Zurück zum Zitat Yu, J., Liu, D., Tao, D., Seah, H.S.: On combining multiple features for cartoon character retrieval and clip synthesis. IEEE Trans. Syst. Man Cybern. Part B 42(5), 1413–1427 (2012)CrossRef Yu, J., Liu, D., Tao, D., Seah, H.S.: On combining multiple features for cartoon character retrieval and clip synthesis. IEEE Trans. Syst. Man Cybern. Part B 42(5), 1413–1427 (2012)CrossRef
32.
Zurück zum Zitat Yu, J., Tao, D., Rui, Y., Cheng, J.: Pairwise constraints based multiview features fusion for scene classification. Pattern Recognit. 46(2), 483–496 (2013)CrossRefMATH Yu, J., Tao, D., Rui, Y., Cheng, J.: Pairwise constraints based multiview features fusion for scene classification. Pattern Recognit. 46(2), 483–496 (2013)CrossRefMATH
33.
Zurück zum Zitat Yu, J., Tao, D., Wang, M.: Adaptive hypergraph learning and its application in image classification. IEEE Trans. Image Process. 21(7), 3262–3272 (2012)MathSciNetCrossRef Yu, J., Tao, D., Wang, M.: Adaptive hypergraph learning and its application in image classification. IEEE Trans. Image Process. 21(7), 3262–3272 (2012)MathSciNetCrossRef
34.
Zurück zum Zitat Yu, J., Wang, M., Tao, D.: Semi-supervised multiview distance metric learning for cartoon synthesis. IEEE Trans. Image Process. 21(11), 4636–4648 (2012)MathSciNetCrossRef Yu, J., Wang, M., Tao, D.: Semi-supervised multiview distance metric learning for cartoon synthesis. IEEE Trans. Image Process. 21(11), 4636–4648 (2012)MathSciNetCrossRef
Metadaten
Titel
Sparse semantic metric learning for image retrieval
verfasst von
Jing Liu
Zechao Li
Hanqing Lu
Publikationsdatum
01.11.2014
Verlag
Springer Berlin Heidelberg
Erschienen in
Multimedia Systems / Ausgabe 6/2014
Print ISSN: 0942-4962
Elektronische ISSN: 1432-1882
DOI
https://doi.org/10.1007/s00530-013-0308-2

Weitere Artikel der Ausgabe 6/2014

Multimedia Systems 6/2014 Zur Ausgabe

Neuer Inhalt