Skip to main content

2016 | OriginalPaper | Buchkapitel

OGB: A Distinctive and Efficient Feature for Mobile Augmented Reality

verfasst von : Xin Yang, Xinggang Wang, Kwang-Ting (Tim) Cheng

Erschienen in: MultiMedia Modeling

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The distinctiveness and efficiency of a feature descriptor used for object recognition and tracking are fundamental to the user experience of a mobile augmented reality (MAR) system. However, existing descriptors are either too compute-expensive to achieve real-time performance on a mobile device, or not sufficiently distinctive to identify correct matches from a large database. As a result, current MAR systems are still limited in both functionalities and capabilities, which greatly restrict their deployment in practice. In this paper, we propose a highly distinctive and efficient binary descriptor, called Oriented Gradients Binary (OGB). OGB captures the major edge/gradient structure that is an important characteristic of local shapes and appearance. Specifically, OGB computes the distribution of major edge/gradient directions within an image patch. To achieve high efficiency, aggressive down-sampling is applied to the patch to significantly reduce the computational complexity, while maintaining major edge/gradient directions within the patch. Comparing to the state-of-the-art binary descriptors including ORB, BRISK and FREAK, which are primarily designed for speed, OGB has similar construction efficiency, while achieves a superior performance for both object recognition and tracking tasks running on a mobile handheld device.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Compu. Vision 60(2), 91–110 (2004)CrossRef Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Compu. Vision 60(2), 91–110 (2004)CrossRef
2.
Zurück zum Zitat Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proceedings of CVPR 2005 (2005) Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proceedings of CVPR 2005 (2005)
3.
Zurück zum Zitat Calonder, M., Lepetit, V., Strecha, C., Fua, P.: Brief: binary robust independent elementary features. In: Proceedings of ECCV 2010 (2010) Calonder, M., Lepetit, V., Strecha, C., Fua, P.: Brief: binary robust independent elementary features. In: Proceedings of ECCV 2010 (2010)
4.
Zurück zum Zitat Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: ORB: an efficient alternative to SIFT or SURF. In: Proceedings of ICCV 2011, Barcelona, Spain (2011) Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: ORB: an efficient alternative to SIFT or SURF. In: Proceedings of ICCV 2011, Barcelona, Spain (2011)
5.
Zurück zum Zitat Leutenegger, S., Chli, M., Siegwart, R.: BRISK: binary robust invariant scalable keypoints. In: Proceedings of CVPR 2011 (2011) Leutenegger, S., Chli, M., Siegwart, R.: BRISK: binary robust invariant scalable keypoints. In: Proceedings of CVPR 2011 (2011)
6.
Zurück zum Zitat Alahi, A., Ortiz, R., Vandergheynst, P.: FREAK: fast retinal keypoint. In: Proceedings of CVPR 2012 (2012) Alahi, A., Ortiz, R., Vandergheynst, P.: FREAK: fast retinal keypoint. In: Proceedings of CVPR 2012 (2012)
7.
Zurück zum Zitat Salakhutdinov, R., Hinton, G.: Semantic hashing. Int. J. Approximate Reasoning, 3 (2009) Salakhutdinov, R., Hinton, G.: Semantic hashing. Int. J. Approximate Reasoning, 3 (2009)
8.
Zurück zum Zitat Weiss, Y., Fergus, R., Torralba, A.: Spectral hashing. In: Proceedings of NIPS 2009, pp: 1753–1760 (2009) Weiss, Y., Fergus, R., Torralba, A.: Spectral hashing. In: Proceedings of NIPS 2009, pp: 1753–1760 (2009)
9.
Zurück zum Zitat Gong, Y., Lazebnik, S., Gordo, A., Perronnin, F.: Iterative quantization: a procrustean approach to learning binanry codes for large-scale image retrieval. IEEE Trans. PAMI (2012) Gong, Y., Lazebnik, S., Gordo, A., Perronnin, F.: Iterative quantization: a procrustean approach to learning binanry codes for large-scale image retrieval. IEEE Trans. PAMI (2012)
10.
Zurück zum Zitat Wang, J., Kumar, S., Chang, S.-F. Sequential projection learning for hashing with compact codes. In: Proceedings of ICML 2010 (2010) Wang, J., Kumar, S., Chang, S.-F. Sequential projection learning for hashing with compact codes. In: Proceedings of ICML 2010 (2010)
11.
Zurück zum Zitat Wagner, D., Reitmayr, G., Mulloni, A., Drummond, T., Schmalstieg, D.: Pose tracking from natural features on mobile phones. In: Proceedings of ISMAR 2008 (2008) Wagner, D., Reitmayr, G., Mulloni, A., Drummond, T., Schmalstieg, D.: Pose tracking from natural features on mobile phones. In: Proceedings of ISMAR 2008 (2008)
12.
Zurück zum Zitat Wagner, D., Schmalstieg, D., Bischof, H.: Multiple target detection and tracking with guaranteed framerates on mobile phones. In: Proceedings of ISMAR 2009 (2009) Wagner, D., Schmalstieg, D., Bischof, H.: Multiple target detection and tracking with guaranteed framerates on mobile phones. In: Proceedings of ISMAR 2009 (2009)
13.
Zurück zum Zitat Wagner, D., Mulloni, A., Langlotz, T., Schmalstieg, D.: Real-time panoramic mapping and tracking on mobile phones. In: Proceedings of IEEE VR 2010 (2010) Wagner, D., Mulloni, A., Langlotz, T., Schmalstieg, D.: Real-time panoramic mapping and tracking on mobile phones. In: Proceedings of IEEE VR 2010 (2010)
14.
Zurück zum Zitat Klein, G., Murray, D.: Parallel tracking and mapping on a camera phone. In: Proceedings of ISMAR 2009, Orlando (October 2009) Klein, G., Murray, D.: Parallel tracking and mapping on a camera phone. In: Proceedings of ISMAR 2009, Orlando (October 2009)
15.
Zurück zum Zitat Parker, J., Kenyon, R., Troxel, D.: Comparison of interpolating methods for image resampling. IEEE Trans. Med. Imaging 2(1), 31–39 (1983)CrossRef Parker, J., Kenyon, R., Troxel, D.: Comparison of interpolating methods for image resampling. IEEE Trans. Med. Imaging 2(1), 31–39 (1983)CrossRef
16.
Zurück zum Zitat Rosten, E., Porter, R., Drummond, T.: Faster and better: a machine learning approach to corner detection. IEEE Trans. PAMI 32, 105–119 (2010)CrossRef Rosten, E., Porter, R., Drummond, T.: Faster and better: a machine learning approach to corner detection. IEEE Trans. PAMI 32, 105–119 (2010)CrossRef
19.
Zurück zum Zitat Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The PASCAL visual object classes challenge (2009) Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The PASCAL visual object classes challenge (2009)
20.
Zurück zum Zitat Fischler, M.A., Bolles, R.C.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Comm. ACM 24(6), 381–395 (1981)MathSciNetCrossRef Fischler, M.A., Bolles, R.C.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Comm. ACM 24(6), 381–395 (1981)MathSciNetCrossRef
21.
Zurück zum Zitat Chum, O., Matas, J.: Matching with PROSAC – progressive sample consensus. In: Proceedings of CVPR 2005, vol. 1, pp. 220–226 (2005) Chum, O., Matas, J.: Matching with PROSAC – progressive sample consensus. In: Proceedings of CVPR 2005, vol. 1, pp. 220–226 (2005)
22.
Zurück zum Zitat Gionis, A., Indyk, P., Motwani, R.: Similarity search in high dimensions via hashing. In: Proceedings of VLDB (1999) Gionis, A., Indyk, P., Motwani, R.: Similarity search in high dimensions via hashing. In: Proceedings of VLDB (1999)
23.
Zurück zum Zitat Lv, Q., Josephson, W., Wang, Z., Charikar, M., Li, K.: Multi-probe LSH: efficient indexing for high-dimensional similarity search. In: Proceedings of VLDB (2007) Lv, Q., Josephson, W., Wang, Z., Charikar, M., Li, K.: Multi-probe LSH: efficient indexing for high-dimensional similarity search. In: Proceedings of VLDB (2007)
24.
Zurück zum Zitat Hong, R.C., Tang, L.X., Hu, J., Li, G.D., Jiang, J.G.: Advertising object in web videos. Neurocomputing 119, 118–124 (2013)CrossRef Hong, R.C., Tang, L.X., Hu, J., Li, G.D., Jiang, J.G.: Advertising object in web videos. Neurocomputing 119, 118–124 (2013)CrossRef
25.
Zurück zum Zitat Wang, M., Li, G.D., Lu, Z., Gao, Y., Chua, T.-S.: When amazon meets google: product visualization by exploring multiple information sources. ACM Trans. Internet Technol. 12(4), Article 2 (2013)CrossRef Wang, M., Li, G.D., Lu, Z., Gao, Y., Chua, T.-S.: When amazon meets google: product visualization by exploring multiple information sources. ACM Trans. Internet Technol. 12(4), Article 2 (2013)CrossRef
26.
Zurück zum Zitat Wang, M., Li, H., Tao, D.C., Lu, K., Wu, X.D.: Multimodal graph-based reranking for web image search. IEEE Trans. Image Process. 21(11), 4649–4661 (2012)MathSciNetCrossRef Wang, M., Li, H., Tao, D.C., Lu, K., Wu, X.D.: Multimodal graph-based reranking for web image search. IEEE Trans. Image Process. 21(11), 4649–4661 (2012)MathSciNetCrossRef
Metadaten
Titel
OGB: A Distinctive and Efficient Feature for Mobile Augmented Reality
verfasst von
Xin Yang
Xinggang Wang
Kwang-Ting (Tim) Cheng
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-27671-7_40

Neuer Inhalt