Skip to main content
Top

2017 | OriginalPaper | Chapter

Graspable Object Classification with Multi-loss Hierarchical Representations

Authors : Zhichao Wang, Zhiqi Li, Bin Wang, Hong Liu

Published in: Intelligent Robotics and Applications

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

To allow robots to accomplish manipulation work effectively, one of the critical functions they need is to precisely and robustly recognize the robotic graspable object and the category of the graspable objects, especially in data limited condition. In this paper, we propose a novel multi-loss hierarchical representations learning framework that is capable of recognizing the category of graspable objects in a coarse-to-fine way. Our model consists of two main components, an efficient hierarchical feature learning component that combines kernel features with the deep learning features and a multi-loss function that optimizes the multi-task learning mechanism in a coarse-to-fine way. We demonstrate the power of our proposed system to data of graspable and ungraspable objects. The results show that our system has superior performance than many existing algorithms both in terms of classification accuracy and computation efficiency. Moreover, our system achieves a quite high accuracy (about 82%) in unstructured real-world condition.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Lenz, I., Lee, H., Saxena, A.: Deep learning for detecting robotic grasps. Int. J. Robot. Res. 34(4–5), 705–724 (2015)CrossRef Lenz, I., Lee, H., Saxena, A.: Deep learning for detecting robotic grasps. Int. J. Robot. Res. 34(4–5), 705–724 (2015)CrossRef
2.
go back to reference Redmon, J., Angelova, A.: Real-time grasp detection using convolutional neural networks 2015, pp. 1316–1322 (2015) Redmon, J., Angelova, A.: Real-time grasp detection using convolutional neural networks 2015, pp. 1316–1322 (2015)
4.
go back to reference Girshick, R.: Fast R-CNN. In: International Conference on Computer Vision (ICCV) (2015) Girshick, R.: Fast R-CNN. In: International Conference on Computer Vision (ICCV) (2015)
5.
go back to reference Bo, L., Lai, K., Ren, X., Fox, D.: Object recognition with hierarchical kernel descriptors. In: 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1729–1736. IEEE (2011) Bo, L., Lai, K., Ren, X., Fox, D.: Object recognition with hierarchical kernel descriptors. In: 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1729–1736. IEEE (2011)
6.
7.
go back to reference Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012) Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
8.
go back to reference Karpathy, A., Fei-Fei, L.: Deep visual-semantic alignments for generating image descriptions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3128–3137 (2015) Karpathy, A., Fei-Fei, L.: Deep visual-semantic alignments for generating image descriptions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3128–3137 (2015)
9.
go back to reference Saxena, A., Driemeyer, J., Ng, A.Y.: Robotic grasping of novel objects using vision. Int. J. Robot. Res. 27(2), 157–173 (2008)CrossRef Saxena, A., Driemeyer, J., Ng, A.Y.: Robotic grasping of novel objects using vision. Int. J. Robot. Res. 27(2), 157–173 (2008)CrossRef
10.
go back to reference Levine, S., Pastor, P., Krizhevsky, A., Quillen, D.: Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection (2016) Levine, S., Pastor, P., Krizhevsky, A., Quillen, D.: Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection (2016)
11.
go back to reference Bohg, J., Morales, A., Asfour, T., Kragic, D.: Data-driven grasp synthesis a survey. IEEE Trans. Rob. 30(2), 289–309 (2014)CrossRef Bohg, J., Morales, A., Asfour, T., Kragic, D.: Data-driven grasp synthesis a survey. IEEE Trans. Rob. 30(2), 289–309 (2014)CrossRef
12.
go back to reference Pinto, L., Gupta, A.: Supersizing self-supervision: Learning to grasp from 50k tries and 700 robot hours (2015) Pinto, L., Gupta, A.: Supersizing self-supervision: Learning to grasp from 50k tries and 700 robot hours (2015)
13.
go back to reference Dai, J., He, K., Sun, J.: Instance-aware semantic segmentation via multi-task network cascades, pp. 3150–3158 (2016) Dai, J., He, K., Sun, J.: Instance-aware semantic segmentation via multi-task network cascades, pp. 3150–3158 (2016)
14.
go back to reference Wang, K., Lin, L., Zuo, W., Gu, S., Zhang, L.: Dictionary pair classifier driven convolutional neural networks for object detection. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2138–2146 (2016) Wang, K., Lin, L., Zuo, W., Gu, S., Zhang, L.: Dictionary pair classifier driven convolutional neural networks for object detection. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2138–2146 (2016)
15.
go back to reference Bo, L., Ren, X., Fox, D.: Kernel descriptors for visual recognition. In: Advances in Neural Information Processing Systems, pp. 244–252 (2010) Bo, L., Ren, X., Fox, D.: Kernel descriptors for visual recognition. In: Advances in Neural Information Processing Systems, pp. 244–252 (2010)
16.
go back to reference Schölkopf, B., Smola, A., Müller, K.-R.: Kernel principal component analysis. In: Gerstner, W., Germond, A., Hasler, M., Nicoud, J.-D. (eds.) ICANN 1997. LNCS, vol. 1327, pp. 583–588. Springer, Heidelberg (1997). doi:10.1007/BFb0020217 Schölkopf, B., Smola, A., Müller, K.-R.: Kernel principal component analysis. In: Gerstner, W., Germond, A., Hasler, M., Nicoud, J.-D. (eds.) ICANN 1997. LNCS, vol. 1327, pp. 583–588. Springer, Heidelberg (1997). doi:10.​1007/​BFb0020217
17.
go back to reference Wang, Q.: Kernel principal component analysis and its applications in face recognition and active shape models (2012). arXiv preprint arXiv:1207.3538 Wang, Q.: Kernel principal component analysis and its applications in face recognition and active shape models (2012). arXiv preprint arXiv:​1207.​3538
18.
go back to reference Dauphin, Y., De Vries, H., Chung, J., Bengio, Y.: RMSprop and equilibrated adaptive learning rates for non-convex optimization. arxiv preprint (2015). arXiv preprint arXiv:1502.04390 Dauphin, Y., De Vries, H., Chung, J., Bengio, Y.: RMSprop and equilibrated adaptive learning rates for non-convex optimization. arxiv preprint (2015). arXiv preprint arXiv:​1502.​04390
19.
go back to reference Lai, K., Bo, L., Ren, X., Fox, D.: A large-scale hierarchical multi-view RGB-D object dataset. In: 2011 IEEE International Conference on Robotics and Automation (ICRA), pp. 1817–1824. IEEE (2011) Lai, K., Bo, L., Ren, X., Fox, D.: A large-scale hierarchical multi-view RGB-D object dataset. In: 2011 IEEE International Conference on Robotics and Automation (ICRA), pp. 1817–1824. IEEE (2011)
20.
go back to reference Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vision 60(2), 91–110 (2004)CrossRef Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vision 60(2), 91–110 (2004)CrossRef
Metadata
Title
Graspable Object Classification with Multi-loss Hierarchical Representations
Authors
Zhichao Wang
Zhiqi Li
Bin Wang
Hong Liu
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-65298-6_42

Premium Partner