Skip to main content

2016 | OriginalPaper | Buchkapitel

Dealing with Ambiguous Queries in Multimodal Video Retrieval

verfasst von : Luca Rossetto, Claudiu Tănase, Heiko Schuldt

Erschienen in: MultiMedia Modeling

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Dealing with ambiguous queries is an important challenge in information retrieval (IR). While this problem is well understood in text retrieval, this is not the case in video retrieval, especially when multimodal queries have to be considered as for instance in Query-by-Example or Query-by-Sketch. Systems supporting such query types usually consider dedicated features for the different modalities. This can be intrinsic object features like color, edge, or texture for the visual modality or motion for the kinesthetic modality. Sketch-based queries are naturally inclined to be ambiguous as they lack specification in some information channels. In this case, the IR system has to deal with the lack of information in a query, as it cannot deduce whether this information should be absent in the result or whether it has simply not been specified, and needs to properly select the features to be considered. In this paper, we present an approach that deals with such ambiguous queries in sketch-based multimodal video retrieval. This approach anticipates the intent(s) of a user based on the information specified in a query and accordingly selects the features to be considered for query execution. We have evaluated our approach based on Cineast, a sketch-based video retrieval system. The evaluation results show that disregarding certain features based on the anticipated query intent(s) can lead to an increase in retrieval quality of more than 25 % over a generic query execution strategy.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Amir, A., Berg, M., Permuter, H.: Mutual relevance feedback for multimodal query formulation in video retrieval. In: Proceedings of the 7th ACM SIGMM International Workshop on Multimedia Information Retrieval, pp. 17–24. ACM (2005) Amir, A., Berg, M., Permuter, H.: Mutual relevance feedback for multimodal query formulation in video retrieval. In: Proceedings of the 7th ACM SIGMM International Workshop on Multimedia Information Retrieval, pp. 17–24. ACM (2005)
2.
Zurück zum Zitat Burges, C., Shaked, T., Renshaw, E., Lazier, A., Deeds, M., Hamilton, N., Hullender, G.: Learning to rank using gradient descent. In: Proceedings of the 22nd International Conference on Machine Learning, pp. 89–96. ACM (2005) Burges, C., Shaked, T., Renshaw, E., Lazier, A., Deeds, M., Hamilton, N., Hullender, G.: Learning to rank using gradient descent. In: Proceedings of the 22nd International Conference on Machine Learning, pp. 89–96. ACM (2005)
3.
Zurück zum Zitat Cronen-Townsend, S., Bruce Croft, W.: Quantifying query ambiguity. In: Proceedings of the Second International Conference on Human Language Technology Research, pp. 104–109. Morgan Kaufmann Publishers Inc. (2002) Cronen-Townsend, S., Bruce Croft, W.: Quantifying query ambiguity. In: Proceedings of the Second International Conference on Human Language Technology Research, pp. 104–109. Morgan Kaufmann Publishers Inc. (2002)
4.
Zurück zum Zitat Eskevich, M., Aly, R., Racca, D., Ordelman, R., Chen, S., Jones, G.J.F.: The search and hyperlinking task at mediaeval 2014 (2014) Eskevich, M., Aly, R., Racca, D., Ordelman, R., Chen, S., Jones, G.J.F.: The search and hyperlinking task at mediaeval 2014 (2014)
5.
Zurück zum Zitat Geng, X., Liu, T.-Y., Qin, T., Li, H.: Feature selection for ranking. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 407–414. ACM (2007) Geng, X., Liu, T.-Y., Qin, T., Li, H.: Feature selection for ranking. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 407–414. ACM (2007)
6.
Zurück zum Zitat Herbrich, R., Graepel, T., Obermayer, K.: Large margin rank boundaries for ordinal regression. Advances in Neural Information Processing Systems, pp. 115–132 (1999) Herbrich, R., Graepel, T., Obermayer, K.: Large margin rank boundaries for ordinal regression. Advances in Neural Information Processing Systems, pp. 115–132 (1999)
7.
Zurück zum Zitat Joachims, T.: Optimizing search engines using clickthrough data. In: Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 133–142. ACM (2002) Joachims, T.: Optimizing search engines using clickthrough data. In: Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 133–142. ACM (2002)
8.
Zurück zum Zitat Kabary, I.A., Schuldt, H.: Using hand gestures for specifying motion queries in sketch-based video retrieval. In: de Rijke, M., Kenter, T., de Vries, A.P., Zhai, C.X., de Jong, F., Radinsky, K., Hofmann, K. (eds.) ECIR 2014. LNCS, vol. 8416, pp. 733–736. Springer, Heidelberg (2014) CrossRef Kabary, I.A., Schuldt, H.: Using hand gestures for specifying motion queries in sketch-based video retrieval. In: de Rijke, M., Kenter, T., de Vries, A.P., Zhai, C.X., de Jong, F., Radinsky, K., Hofmann, K. (eds.) ECIR 2014. LNCS, vol. 8416, pp. 733–736. Springer, Heidelberg (2014) CrossRef
9.
Zurück zum Zitat Novaković, J., Štrbac, P., Bulatović, D.: Toward optimal feature selection using ranking methods and classification algorithms. Yugoslav J. Oper. Res. 21(1) (2011) ISSN: 0354–0243 EISSN: 2334–6043 Novaković, J., Štrbac, P., Bulatović, D.: Toward optimal feature selection using ranking methods and classification algorithms. Yugoslav J. Oper. Res. 21(1) (2011) ISSN: 0354–0243 EISSN: 2334–6043
10.
Zurück zum Zitat Over, P., Awad, G., Michel, M., Fiscus, J., Sanders, G., Kraaij, W., Smeaton, A.F., Quénot, G.: Trecvid 2014- an overview of the goals. Tasks, data, evaluation mechanisms, and metrics. In: Proceedings of TRECVID (2014) Over, P., Awad, G., Michel, M., Fiscus, J., Sanders, G., Kraaij, W., Smeaton, A.F., Quénot, G.: Trecvid 2014- an overview of the goals. Tasks, data, evaluation mechanisms, and metrics. In: Proceedings of TRECVID (2014)
11.
Zurück zum Zitat Qiu, G., Liu, K., Bu, J., Chen, C., Kang, Z.: Quantify query ambiguity using odp metadata. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 697–698. ACM (2007) Qiu, G., Liu, K., Bu, J., Chen, C., Kang, Z.: Quantify query ambiguity using odp metadata. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 697–698. ACM (2007)
12.
Zurück zum Zitat Rossetto, L., Giangreco, I., Heller, S., Tănase, C., Schuldt, H.: Searching in video collections using sketches and sample images - the Cineast system. In: Tian, Q., Sebe, N., G.J., Qi, Huet, B., Hong, R., Liu, L. (eds.) MultiMedia Modeling. LNCS, vol. 9516, pp. 336–341. Springer, Heidelberg (2016) Rossetto, L., Giangreco, I., Heller, S., Tănase, C., Schuldt, H.: Searching in video collections using sketches and sample images - the Cineast system. In: Tian, Q., Sebe, N., G.J., Qi, Huet, B., Hong, R., Liu, L. (eds.) MultiMedia Modeling. LNCS, vol. 9516, pp. 336–341. Springer, Heidelberg (2016)
13.
Zurück zum Zitat Rossetto, L., Giangreco, I., Schuldt, H.: Cineast: a multi-feature sketch-based video retrieval engine. In: 2014 IEEE International Symposium on Multimedia (ISM), pp. 18–23. IEEE (2014) Rossetto, L., Giangreco, I., Schuldt, H.: Cineast: a multi-feature sketch-based video retrieval engine. In: 2014 IEEE International Symposium on Multimedia (ISM), pp. 18–23. IEEE (2014)
14.
Zurück zum Zitat Rossetto, L., Giangreco, I., Schuldt, H.: OSVC - Open Short Video Collection 1.0. Technical report (CS-2015-002), University of Basel (2015) Rossetto, L., Giangreco, I., Schuldt, H.: OSVC - Open Short Video Collection 1.0. Technical report (CS-2015-002), University of Basel (2015)
15.
Zurück zum Zitat Snoek, C.G.M., Worring, M., Smeulders, A.W.M.: Early versus late fusion in semantic video analysis. In: Proceedings of the 13th Annual ACM International Conference on Multimedia, pp. 399–402. ACM (2005) Snoek, C.G.M., Worring, M., Smeulders, A.W.M.: Early versus late fusion in semantic video analysis. In: Proceedings of the 13th Annual ACM International Conference on Multimedia, pp. 399–402. ACM (2005)
16.
Zurück zum Zitat Song, R., Luo, Z., Wen, J.-R., Yu, Y., Hon, H.-W.: Identifying ambiguous queries in web search. In: Proceedings of the 16th International Conference on World Wide Web, pp. 1169–1170. ACM (2007) Song, R., Luo, Z., Wen, J.-R., Yu, Y., Hon, H.-W.: Identifying ambiguous queries in web search. In: Proceedings of the 16th International Conference on World Wide Web, pp. 1169–1170. ACM (2007)
17.
Zurück zum Zitat Stojanovic, N.: On analysing query ambiguity for query refinement: the librarian agent approach. In: Song, I.-Y., Liddle, S.W., Ling, T.-W., Scheuermann, P. (eds.) ER 2003. LNCS, vol. 2813, pp. 490–505. Springer, Heidelberg (2003) CrossRef Stojanovic, N.: On analysing query ambiguity for query refinement: the librarian agent approach. In: Song, I.-Y., Liddle, S.W., Ling, T.-W., Scheuermann, P. (eds.) ER 2003. LNCS, vol. 2813, pp. 490–505. Springer, Heidelberg (2003) CrossRef
18.
Zurück zum Zitat Weinberger, K.Q., Slaney, M., Van Zwol, R.: Resolving tag ambiguity. In: Proceedings of the 16th ACM International Conference on Multimedia, pp. 111–120. ACM (2008) Weinberger, K.Q., Slaney, M., Van Zwol, R.: Resolving tag ambiguity. In: Proceedings of the 16th ACM International Conference on Multimedia, pp. 111–120. ACM (2008)
19.
Zurück zum Zitat Zha, Z.-J., Yang, L., Mei, T., Wang, M., Wang, Z., Chua, T.-S., Hua, X.-S.: Visual query suggestion: towards capturing user intent in internet image search. ACM Trans. Multimedia Comput. Commun. Appl. (TOMM) 6(3), 13 (2010) Zha, Z.-J., Yang, L., Mei, T., Wang, M., Wang, Z., Chua, T.-S., Hua, X.-S.: Visual query suggestion: towards capturing user intent in internet image search. ACM Trans. Multimedia Comput. Commun. Appl. (TOMM) 6(3), 13 (2010)
Metadaten
Titel
Dealing with Ambiguous Queries in Multimodal Video Retrieval
verfasst von
Luca Rossetto
Claudiu Tănase
Heiko Schuldt
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-27671-7_75

Neuer Inhalt