Skip to main content

2016 | OriginalPaper | Buchkapitel

Utilizing Sensor-Social Cues to Localize Objects-of-Interest in Outdoor UGVs

verfasst von : Yingjie Xia, Luming Zhang, Liqiang Nie, Wenjing Geng

Erschienen in: MultiMedia Modeling

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

A huge number of outdoor user-generated videos (UGVs) are recorded daily due to the popularity of mobile intelligent devices. Managing these videos is a tough challenge in multimedia field. In this paper, we tackle this problem by performing object-of-interest (OOI) recognition in UGVs to identify semantically important regions. By leveraging geo-sensor and social data, we propose a novel framework for OOI recognition in outdoor UGVs. Firstly, the OOI acquisition is conducted to obtain an OOI frame set from UGVs. Simultaneously, the classified object set recommendation is performed to obtain a candidate category name set from social networks. Afterward, a spatial pyramid representation is deployed to describe social objects from images and OOIs from UGVs, respectively. Finally, OOIs with their annotated names are labeled in UGVs. Extensive experiments in outdoor UGVs from both Nanjing and Singapore demonstrated the competitiveness of our approach.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Zheng, Y.-T., Zhao, M., Song, Y., Adam, H.: Tour the world: building a web-scale landmark recognition engine. In: Proceedings of CVPR (2009) Zheng, Y.-T., Zhao, M., Song, Y., Adam, H.: Tour the world: building a web-scale landmark recognition engine. In: Proceedings of CVPR (2009)
2.
Zurück zum Zitat Hao, J., Wang, G., Seo, B., Zimmermann, R.: Point of interest detection and visual distance estimation for sensor-rich video. IEEE T-MM 16(7), 1929–1941 (2014) Hao, J., Wang, G., Seo, B., Zimmermann, R.: Point of interest detection and visual distance estimation for sensor-rich video. IEEE T-MM 16(7), 1929–1941 (2014)
3.
Zurück zum Zitat Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Proceedings of CVPR (2006) Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Proceedings of CVPR (2006)
4.
Zurück zum Zitat Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proceedings of CVPR (2005) Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proceedings of CVPR (2005)
5.
Zurück zum Zitat Felzenszwalb, P.F., Girshick, R.B., McAllester, D.A., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE T-PAMI 32(9), 1627–1645 (2010)CrossRef Felzenszwalb, P.F., Girshick, R.B., McAllester, D.A., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE T-PAMI 32(9), 1627–1645 (2010)CrossRef
6.
Zurück zum Zitat Yang, K., Wang, M., Hua, X.-S., Yan, S., Zhang, H.-J.: Assemble new object detector with few examples. IEEE T-IP 20(12), 3341–3349 (2011)MathSciNetCrossRef Yang, K., Wang, M., Hua, X.-S., Yan, S., Zhang, H.-J.: Assemble new object detector with few examples. IEEE T-IP 20(12), 3341–3349 (2011)MathSciNetCrossRef
7.
Zurück zum Zitat Wang, M., Hua, X.-S., Hong, R., Tang, J., Qi, G.-J., Song, Y.: Unified video annotation via multi-graph learning. IEEE T-CSVT 19(5), 733–746 (2009) Wang, M., Hua, X.-S., Hong, R., Tang, J., Qi, G.-J., Song, Y.: Unified video annotation via multi-graph learning. IEEE T-CSVT 19(5), 733–746 (2009)
8.
Zurück zum Zitat Harzallah, H., Jurie, F., Schmid, C.: Combining efficient object localization and image classification. In: Proceedings of ICCV (2009) Harzallah, H., Jurie, F., Schmid, C.: Combining efficient object localization and image classification. In: Proceedings of ICCV (2009)
9.
Zurück zum Zitat Vedaldi, A., Gulshan, V., Varma, M., Zisserman, A.: Multiple kernels for object detection. In: Proceedings of ICCV (2009) Vedaldi, A., Gulshan, V., Varma, M., Zisserman, A.: Multiple kernels for object detection. In: Proceedings of ICCV (2009)
10.
Zurück zum Zitat Cinbis, R.G., Verbeek, J.J., Schmid, C.: Segmentation driven object detection with fisher vectors. In: Proceedings of ICCV (2013) Cinbis, R.G., Verbeek, J.J., Schmid, C.: Segmentation driven object detection with fisher vectors. In: Proceedings of ICCV (2013)
11.
Zurück zum Zitat Kim, S., Park, S., Kim, M.: Central object extraction for object-based image retrieval. In: Bakker, E.M., Lew, M., Huang, T.S., Sebe, N., Zhou, X.S. (eds.) CIVR 2003. LNCS, vol. 2728, pp. 39–49. Springer, Heidelberg (2003) CrossRef Kim, S., Park, S., Kim, M.: Central object extraction for object-based image retrieval. In: Bakker, E.M., Lew, M., Huang, T.S., Sebe, N., Zhou, X.S. (eds.) CIVR 2003. LNCS, vol. 2728, pp. 39–49. Springer, Heidelberg (2003) CrossRef
12.
Zurück zum Zitat Zhang, D., Javed, O., Shah, M.: Video object segmentation through spatially accurate and temporally dense extraction of primary object regions. In: Proceedings of CIVR (2013) Zhang, D., Javed, O., Shah, M.: Video object segmentation through spatially accurate and temporally dense extraction of primary object regions. In: Proceedings of CIVR (2013)
13.
Zurück zum Zitat Jiang, H., Wang, J., Yuan, Z., Liu, T., Zheng, N.: Automatic salient object segmentation based on context and shape prior. In: Proceedings of BMVC (2011) Jiang, H., Wang, J., Yuan, Z., Liu, T., Zheng, N.: Automatic salient object segmentation based on context and shape prior. In: Proceedings of BMVC (2011)
14.
Zurück zum Zitat Khuwuthyakorn, P., Robles-Kelly, A., Zhou, J.: Object of interest detection by saliency learning. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part II. LNCS, vol. 6312, pp. 636–649. Springer, Heidelberg (2010) CrossRef Khuwuthyakorn, P., Robles-Kelly, A., Zhou, J.: Object of interest detection by saliency learning. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part II. LNCS, vol. 6312, pp. 636–649. Springer, Heidelberg (2010) CrossRef
15.
Zurück zum Zitat Margolin, R., Tal, A., Zelnik-Manor, L.: What makes a patch distinct? In: Proceedings of CVPR (2013) Margolin, R., Tal, A., Zelnik-Manor, L.: What makes a patch distinct? In: Proceedings of CVPR (2013)
16.
Zurück zum Zitat Rosin, P.L.: A simple method for detecting salient regions. Pattern Recogn. 42(11), 2363–2371 (2009)MATHCrossRef Rosin, P.L.: A simple method for detecting salient regions. Pattern Recogn. 42(11), 2363–2371 (2009)MATHCrossRef
17.
Zurück zum Zitat Jia, Y., Han, M.: Category-independent object-level saliency detection. In: Proceedings of ICCV (2013) Jia, Y., Han, M.: Category-independent object-level saliency detection. In: Proceedings of ICCV (2013)
18.
Zurück zum Zitat Jiang, P., Ling, H., Yu, J., Peng, J.: Salient region detection by UFO: uniqueness, focusness and objectness. In: Proceedings of ICCV (2013) Jiang, P., Ling, H., Yu, J., Peng, J.: Salient region detection by UFO: uniqueness, focusness and objectness. In: Proceedings of ICCV (2013)
19.
Zurück zum Zitat Navalpakkam, V., Itti, L.: Modeling the influence of task on attention. Vision. Res. 45(2), 205–231 (2005)CrossRef Navalpakkam, V., Itti, L.: Modeling the influence of task on attention. Vision. Res. 45(2), 205–231 (2005)CrossRef
20.
Zurück zum Zitat Borji, A.: Boosting bottom-up and top-down visual features for saliency estimation. In: Proceedings of CVPR (2012) Borji, A.: Boosting bottom-up and top-down visual features for saliency estimation. In: Proceedings of CVPR (2012)
21.
Zurück zum Zitat Bolch, G., Greiner, S., de Meer, H., Trivedi, K.S.: Queueing Networks and Markov Chains, 2nd edn. John Wiley, Hoboken (2006)MATHCrossRef Bolch, G., Greiner, S., de Meer, H., Trivedi, K.S.: Queueing Networks and Markov Chains, 2nd edn. John Wiley, Hoboken (2006)MATHCrossRef
22.
Zurück zum Zitat Kalal, Z., Mikolajczyk, K., Matas, J.: Tracking-learning-detection. IEEE T-PAMI 34(7), 1409–1422 (2012)CrossRef Kalal, Z., Mikolajczyk, K., Matas, J.: Tracking-learning-detection. IEEE T-PAMI 34(7), 1409–1422 (2012)CrossRef
23.
Zurück zum Zitat Zhang, L., Bian, W., Song, M., Tao, D., Liu, X.: Integrating local features into discriminative graphlets for scene classification. In: Lu, B.-L., Zhang, L., Kwok, J. (eds.) ICONIP 2011, Part III. LNCS, vol. 7064, pp. 657–666. Springer, Heidelberg (2011) CrossRef Zhang, L., Bian, W., Song, M., Tao, D., Liu, X.: Integrating local features into discriminative graphlets for scene classification. In: Lu, B.-L., Zhang, L., Kwok, J. (eds.) ICONIP 2011, Part III. LNCS, vol. 7064, pp. 657–666. Springer, Heidelberg (2011) CrossRef
24.
Zurück zum Zitat Zhang, L., Song, M., Sun, L., Liu, X., Wang, Y., Tao, D., Bu, J., Chen, C.: Spatial graphlet matching kernel for recognizing aerial image categories. In: ICPR (2012) Zhang, L., Song, M., Sun, L., Liu, X., Wang, Y., Tao, D., Bu, J., Chen, C.: Spatial graphlet matching kernel for recognizing aerial image categories. In: ICPR (2012)
25.
Zurück zum Zitat Zhang, L., Gao, Y., Zimmermann, R., Tian, Q., Li, X.: Fusion of multichannel local and global structural cues for photo aesthetics evaluation. IEEE T-IP 23(3), 1419–1429 (2014)MathSciNetCrossRef Zhang, L., Gao, Y., Zimmermann, R., Tian, Q., Li, X.: Fusion of multichannel local and global structural cues for photo aesthetics evaluation. IEEE T-IP 23(3), 1419–1429 (2014)MathSciNetCrossRef
26.
Zurück zum Zitat Zhang, L., Wang, M., Nie, L., Hong, L., Rui, Y., Tian, Q.: Retargeting semantically-rich photos. IEEE T-MM 17(9), 1538–1549 (2015) Zhang, L., Wang, M., Nie, L., Hong, L., Rui, Y., Tian, Q.: Retargeting semantically-rich photos. IEEE T-MM 17(9), 1538–1549 (2015)
27.
Zurück zum Zitat Zhang, L., Gao, Y., Hong, R., Hu, Y., Ji, R., Dai, Q.: Probabilistic skimlets fusion for summarizing multiple consumer landmark videos. IEEE T-MM 17(1), 40–49 (2015) Zhang, L., Gao, Y., Hong, R., Hu, Y., Ji, R., Dai, Q.: Probabilistic skimlets fusion for summarizing multiple consumer landmark videos. IEEE T-MM 17(1), 40–49 (2015)
28.
Zurück zum Zitat Ay, S.A., Zimmermann, R., Kim, S.H.: Viewable scene modeling for geospatial video search. In: ACM Multimedia (2008) Ay, S.A., Zimmermann, R., Kim, S.H.: Viewable scene modeling for geospatial video search. In: ACM Multimedia (2008)
29.
Zurück zum Zitat Zheng, Y.-T., Zha, Z.-J., Chua, T.-S.: Research and applications on georeferenced multimedia. Multimedia Tools Appl. 51(1), 77–98 (2011)CrossRef Zheng, Y.-T., Zha, Z.-J., Chua, T.-S.: Research and applications on georeferenced multimedia. Multimedia Tools Appl. 51(1), 77–98 (2011)CrossRef
30.
Zurück zum Zitat Rodden, K., Wood, K.R.: How do people manage their digital photographs? In: ACM SIGCHI (2003) Rodden, K., Wood, K.R.: How do people manage their digital photographs? In: ACM SIGCHI (2003)
31.
Zurück zum Zitat Kentaro, T., Logan, R., Roseway, A., Anandan, P.: Geographic location tags on digital images. In: ACM Multimedia (2003) Kentaro, T., Logan, R., Roseway, A., Anandan, P.: Geographic location tags on digital images. In: ACM Multimedia (2003)
32.
Zurück zum Zitat Föckler, P., Zeidler, T., Brombach, B., Bruns, E., Bimber, O.: PhoneGuide: museum guidance supported by on-device object recognition on mobile phones. In: Proceedings of Mobile and Ubiquitous Multimedia (2005) Föckler, P., Zeidler, T., Brombach, B., Bruns, E., Bimber, O.: PhoneGuide: museum guidance supported by on-device object recognition on mobile phones. In: Proceedings of Mobile and Ubiquitous Multimedia (2005)
33.
Zurück zum Zitat Gammeter, S., Gassmann, A., Bossard, L.: Server-side object recognition and client-side object tracking for mobile augmented reality. In: Proceedings of CVPR (2010) Gammeter, S., Gassmann, A., Bossard, L.: Server-side object recognition and client-side object tracking for mobile augmented reality. In: Proceedings of CVPR (2010)
34.
Zurück zum Zitat Wang, M., Gao, Y., Ke, L., Rui, Y.: View-based discriminative probabilistic modeling for 3D object retrieval and recognition. IEEE T-IP 22(4), 1395–1407 (2013)CrossRef Wang, M., Gao, Y., Ke, L., Rui, Y.: View-based discriminative probabilistic modeling for 3D object retrieval and recognition. IEEE T-IP 22(4), 1395–1407 (2013)CrossRef
Metadaten
Titel
Utilizing Sensor-Social Cues to Localize Objects-of-Interest in Outdoor UGVs
verfasst von
Yingjie Xia
Luming Zhang
Liqiang Nie
Wenjing Geng
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-27671-7_8

Neuer Inhalt