Skip to main content

2018 | OriginalPaper | Buchkapitel

More Discriminative CNN with Inter Loss for Classification

verfasst von : Jianchao Fei, Ting Rui, Xiaona Song, You Zhou, Sai Zhang

Erschienen in: Artificial Intelligence and Robotics

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Recently years, convolutional neural networks (CNN) has been a hot spot in various areas such as object detection, classification. As deep study in CNN, its performance is almost human-competitive. We find that the test accuracy largely depends on the relationship of samples in feature space. Softmax loss is widely used in many deep learning algorithms. However, it cannot directly reflect this kind of relationship. In this paper, we design a new loss function, named inter loss. This inter loss function can maximizes the distance between different classes, analogous to maximizing margin in SVM. By integrating inter loss and softmax loss, larger inter-class distance and smaller intra-class distance can be obtained. In this way, we can significantly improve the accuracy in classification. Impressive results is obtained in SVHN and CIFAR-10 datasets. However, our main goal is to introduce a novel loss function tasks rather than beating the state-of-the-art. In our experiments, other forms of loss functions based on inter and intra class distance is also considered as to demonstrate the effectiveness of inter loss.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Serre, T., Wolf, L., Poggio, T.: Object recognition with features inspired by visual cortex, In: 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’ 05), vol. 2, IEEE, pp. 994–1000 (2005) Serre, T., Wolf, L., Poggio, T.: Object recognition with features inspired by visual cortex, In: 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’ 05), vol. 2, IEEE, pp. 994–1000 (2005)
2.
Zurück zum Zitat Sermanet, P., Eigen, D., Zhang, X., Mathieu, M., Fergus, R., LeCun, Y.: Over-feat: Integrated recognition, localization and detection using convolutional networks. arXiv:1312.6229 Sermanet, P., Eigen, D., Zhang, X., Mathieu, M., Fergus, R., LeCun, Y.: Over-feat: Integrated recognition, localization and detection using convolutional networks. arXiv:​1312.​6229
3.
Zurück zum Zitat Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation, In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 580–587 (2014) Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation, In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 580–587 (2014)
4.
Zurück zum Zitat He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition, In: European Conference on Computer Vision, Springer, pp. 346–361 (2014) He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition, In: European Conference on Computer Vision, Springer, pp. 346–361 (2014)
5.
Zurück zum Zitat Ren, S., He, K., Girshick, R., Sun, J.: Faster r-cnn: towards real-time object detection with region proposal networks, In: Advances in Neural Information Processing Systems, pp. 91–99 (2015) Ren, S., He, K., Girshick, R., Sun, J.: Faster r-cnn: towards real-time object detection with region proposal networks, In: Advances in Neural Information Processing Systems, pp. 91–99 (2015)
6.
Zurück zum Zitat Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks, In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012) Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks, In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
7.
Zurück zum Zitat Lu, H., Li, Y., Uemura, T., et al.: FDCNet: filtering deep convolutional network for marine organism classification. Multimedia tools and applications, 1–14 (2017) Lu, H., Li, Y., Uemura, T., et al.: FDCNet: filtering deep convolutional network for marine organism classification. Multimedia tools and applications, 1–14 (2017)
9.
Zurück zum Zitat Farabet, C., Couprie, C., Najman, L., LeCun, Y.: Learning hierarchical features for scene labeling. IEEE Trans. Pattern Anal. Mach. Intell. 35(8), 1915–1929 (2013)CrossRef Farabet, C., Couprie, C., Najman, L., LeCun, Y.: Learning hierarchical features for scene labeling. IEEE Trans. Pattern Anal. Mach. Intell. 35(8), 1915–1929 (2013)CrossRef
10.
Zurück zum Zitat Gupta, S., Girshick, R., Arbeláez, P., Malik, J.: Learning rich features from rgb-d images for object detection and segmentation, In: European Conference on Computer Vision, Springer, pp. 345–360 (2014) Gupta, S., Girshick, R., Arbeláez, P., Malik, J.: Learning rich features from rgb-d images for object detection and segmentation, In: European Conference on Computer Vision, Springer, pp. 345–360 (2014)
11.
Zurück zum Zitat Lu, H., Li, B., Zhu, J., et al.: Wound intensity correction and segmentation with convolutional neural networks. Concurr. Computat. Pract. Exper. 29(6) (2017) Lu, H., Li, B., Zhu, J., et al.: Wound intensity correction and segmentation with convolutional neural networks. Concurr. Computat. Pract. Exper. 29(6) (2017)
12.
Zurück zum Zitat Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation, In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440 (2015) Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation, In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440 (2015)
13.
Zurück zum Zitat Glorot, X., Bengio, Y.: Understanding the Difficulty of Training Deep Feed Forward Neural Networks, In: Aistats, vol. 9, pp. 249–256 (2010) Glorot, X., Bengio, Y.: Understanding the Difficulty of Training Deep Feed Forward Neural Networks, In: Aistats, vol. 9, pp. 249–256 (2010)
15.
Zurück zum Zitat Duchi, J., Hazan, E., Singer, Y.: Adaptive subgradient methods for online learning and stochastic optimization, J. Mach. Learn. Res. July 12, 2011 2121–2159 Duchi, J., Hazan, E., Singer, Y.: Adaptive subgradient methods for online learning and stochastic optimization, J. Mach. Learn. Res. July 12, 2011 2121–2159
16.
Zurück zum Zitat Srivastava, N., Hinton, G.E., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(1), 1929–1958 (2014)MathSciNetMATH Srivastava, N., Hinton, G.E., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(1), 1929–1958 (2014)MathSciNetMATH
18.
Zurück zum Zitat Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift. arXiv:1502.03167 Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift. arXiv:​1502.​03167
19.
20.
Zurück zum Zitat Christoffersen, P., Jacobs, K.: The importance of the loss function in option valuation. J. Financ. Econ. 72(2), 291–318 (2004)CrossRef Christoffersen, P., Jacobs, K.: The importance of the loss function in option valuation. J. Financ. Econ. 72(2), 291–318 (2004)CrossRef
21.
Zurück zum Zitat Rosasco, L., De Vito, E., Caponnetto, A., Piana, M., Verri, A.: Are loss functions all the same? Neural. Comput. 16(5), 1063–1076 (2004)CrossRefMATH Rosasco, L., De Vito, E., Caponnetto, A., Piana, M., Verri, A.: Are loss functions all the same? Neural. Comput. 16(5), 1063–1076 (2004)CrossRefMATH
22.
Zurück zum Zitat Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, V., Vanhoucke, Rabinovich, A.: Going deeper with convolutions, In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015) Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, V., Vanhoucke, Rabinovich, A.: Going deeper with convolutions, In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015)
23.
Zurück zum Zitat Basu, A., Ebrahimi, N.: Bayesian approach to life testing and reliability estimation using asymmetric loss function. J. Statis. Plann. Infer. 29(1), 21–31 (1991)CrossRefMathSciNetMATH Basu, A., Ebrahimi, N.: Bayesian approach to life testing and reliability estimation using asymmetric loss function. J. Statis. Plann. Infer. 29(1), 21–31 (1991)CrossRefMathSciNetMATH
24.
Zurück zum Zitat Freund, Y., Schapire, R.E.: A desicion-theoretic generalization of on-line learning and an application to boosting. In: European Conference on Computational Learning Theory, Springer, pp. 23–37 (1995) Freund, Y., Schapire, R.E.: A desicion-theoretic generalization of on-line learning and an application to boosting. In: European Conference on Computational Learning Theory, Springer, pp. 23–37 (1995)
25.
Zurück zum Zitat Mikolov, T.: Statistical Language Models Based on Neural Networks, Presentation at Google, Mountain View, 2nd April Mikolov, T.: Statistical Language Models Based on Neural Networks, Presentation at Google, Mountain View, 2nd April
26.
Zurück zum Zitat Yan, Z., Jagadeesh, V., Decoste, D., Di, W., Piramuthu, R.: Hd-cnn: hierarchical deep convolutional neural network for image classification, In: International Conference on Computer Vision (ICCV), vol. 2, (2015) Yan, Z., Jagadeesh, V., Decoste, D., Di, W., Piramuthu, R.: Hd-cnn: hierarchical deep convolutional neural network for image classification, In: International Conference on Computer Vision (ICCV), vol. 2, (2015)
27.
Zurück zum Zitat Shen, X., Wang, X., Wang, Y., Bai, X., Zhang, Z.: Deepcontour: a deep convolutional feature learned by positive-sharing loss for contour detection, In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3982–3991 (2015) Shen, X., Wang, X., Wang, Y., Bai, X., Zhang, Z.: Deepcontour: a deep convolutional feature learned by positive-sharing loss for contour detection, In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3982–3991 (2015)
28.
Zurück zum Zitat Wen, Y., Zhang, K., Li, Z., Qiao, Y.: A discriminative feature learning approach for deep face recognition. In: European Conference on Computer Vision, Springer, pp. 499–515 (2016) Wen, Y., Zhang, K., Li, Z., Qiao, Y.: A discriminative feature learning approach for deep face recognition. In: European Conference on Computer Vision, Springer, pp. 499–515 (2016)
29.
Zurück zum Zitat Ye, C., Zhao, C., Yang, Y., Fermuller, C., Aloimonos, Y.: Lightnet: A Versatile, standalone Matlab-Based Environment for Deep Learning. arXiv:1605.02766 Ye, C., Zhao, C., Yang, Y., Fermuller, C., Aloimonos, Y.: Lightnet: A Versatile, standalone Matlab-Based Environment for Deep Learning. arXiv:​1605.​02766
Metadaten
Titel
More Discriminative CNN with Inter Loss for Classification
verfasst von
Jianchao Fei
Ting Rui
Xiaona Song
You Zhou
Sai Zhang
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-69877-9_26