Skip to main content
Top

2018 | OriginalPaper | Chapter

A Deep Autoencoder-Based Knowledge Transfer Approach

Author : Sreenivas Sremath Tirumala

Published in: Proceedings of International Conference on Computational Intelligence and Data Engineering

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Deep Transfer Learning or DTS has proven successful with deep neural networks and deep belief networks. However, there has been limited research on to using deep autoencoder (DAE)-based network to implement DTS. This paper for the first time attempts to identify transferable features in the form of learning and transfer them to another network implementing a simple DTS mechanism. In this paper, a transfer of knowledge process is proposed where in knowledge is transferred from one Deep autoencoder network to another. This knowledge transfer has helped to improve the classification accuracy of the receiving autoencoder, particularly when experimented using corrupted dataset. The experiments are carried out on a texa based hierarchical dataset. Firstly, a DAE is trained with regular undamaged dataset to achieve maximum accuracy. Then, a distorted dataset was used to train second DAEN for classification with which only 56.7% of the data is correctly classified. Then a set of weights are transferred from from first DAEN to the second DAEN which resulted in an an improvement of classification accuracy by about 22%. The key contribution of this paper is highlighting importance of knowledge transfer between two deep autoencoder networks which is proposed for the first time.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference D. K. Milligan and M. J. D. Wilson, Fundamental Structure/Behaviour Relationships in Synchronous Boolean Neural Networks, 1990, pp. 997–1000. D. K. Milligan and M. J. D. Wilson, Fundamental Structure/Behaviour Relationships in Synchronous Boolean Neural Networks, 1990, pp. 997–1000.
2.
go back to reference Z. Waszczyszyn, Fundamentals of artificial neural networks. Springer, 1999, pp. 1–51. Z. Waszczyszyn, Fundamentals of artificial neural networks. Springer, 1999, pp. 1–51.
3.
go back to reference A. V. Terekhov, G. Montone, and J. K. OâĂŹRegan, Knowledge Transfer in Deep Block-Modular Neural Networks, 2015, pp. 268–279. A. V. Terekhov, G. Montone, and J. K. OâĂŹRegan, Knowledge Transfer in Deep Block-Modular Neural Networks, 2015, pp. 268–279.
4.
go back to reference Y. Bengio, P. Lamblin, D. Popovici, and H. Larochelle, “Greedy layer-wise training of deep networks,” in Advances in Neural Information Processing Systems 19. MIT Press, 2007, pp. 153–160. Y. Bengio, P. Lamblin, D. Popovici, and H. Larochelle, “Greedy layer-wise training of deep networks,” in Advances in Neural Information Processing Systems 19. MIT Press, 2007, pp. 153–160.
5.
go back to reference Y. Bengio and Y. LeCun, “Scaling learning algorithms towards AI,” in Large Scale Kernel Machines. MIT Press, 2007. Y. Bengio and Y. LeCun, “Scaling learning algorithms towards AI,” in Large Scale Kernel Machines. MIT Press, 2007.
6.
go back to reference K. He, X. Zhang, S. Ren, and J. Sun, “Spatial pyramid pooling in deep convolutional networks for visual recognition,” Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. PP, no. 99, pp. 1–1, 2015. K. He, X. Zhang, S. Ren, and J. Sun, “Spatial pyramid pooling in deep convolutional networks for visual recognition,” Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. PP, no. 99, pp. 1–1, 2015.
7.
go back to reference C. Xiong, L. Liu, X. Zhao, S. Yan, and T. Kim, “Convolutional fusion network for face verification in the wild,” Circuits and Systems for Video Technology, IEEE Transactions on, vol. PP, no. 99, pp. 1–1, 2015. C. Xiong, L. Liu, X. Zhao, S. Yan, and T. Kim, “Convolutional fusion network for face verification in the wild,” Circuits and Systems for Video Technology, IEEE Transactions on, vol. PP, no. 99, pp. 1–1, 2015.
8.
go back to reference D. Hingu, D. Shah, and S. S. Udmale, “Automatic text summarization of wikipedia articles,” in Communication, Information Computing Technology (ICCICT), 2015 International Conference on, Jan 2015, pp. 1–4. D. Hingu, D. Shah, and S. S. Udmale, “Automatic text summarization of wikipedia articles,” in Communication, Information Computing Technology (ICCICT), 2015 International Conference on, Jan 2015, pp. 1–4.
9.
go back to reference A. Graves and J. Schmidhuber, “Offline handwriting recognition with multidimensional recurrent neural networks,” pp. 545–552, 2009. A. Graves and J. Schmidhuber, “Offline handwriting recognition with multidimensional recurrent neural networks,” pp. 545–552, 2009.
10.
go back to reference S. S. Tirumala, “Implementation of evolutionary algorithms for deep architectures,” in Proceedings of the 2nd International Workshop on Artificial Intelligence and Cognition (AIC), Torino, Italy, November, 2014, pp. 164–171. S. S. Tirumala, “Implementation of evolutionary algorithms for deep architectures,” in Proceedings of the 2nd International Workshop on Artificial Intelligence and Cognition (AIC), Torino, Italy, November, 2014, pp. 164–171.
11.
go back to reference S. S. Tirumala and A. Narayanan, “Hierarchical data classification using deep neural networks,” in Neural Information Processing. Springer International Publishing, 2015, pp. 492–500. S. S. Tirumala and A. Narayanan, “Hierarchical data classification using deep neural networks,” in Neural Information Processing. Springer International Publishing, 2015, pp. 492–500.
12.
go back to reference E. Y. Li, “Artificial neural networks and their business applications,” Information & Management, vol. 27, no. 5, pp. 303–313, 1994. E. Y. Li, “Artificial neural networks and their business applications,” Information & Management, vol. 27, no. 5, pp. 303–313, 1994.
13.
go back to reference Y. Bengio, A. Courville, and P. Vincent, “Representation learning: A review and new perspectives,” Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 35, no. 8, pp. 1798–1828, 2013. Y. Bengio, A. Courville, and P. Vincent, “Representation learning: A review and new perspectives,” Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 35, no. 8, pp. 1798–1828, 2013.
14.
go back to reference J. Yosinski, J. Clune, Y. Bengio, and H. Lipson, “How transferable are features in deep neural networks?” in Advances in Neural Information Processing Systems, 2014, pp. 3320–3328. J. Yosinski, J. Clune, Y. Bengio, and H. Lipson, “How transferable are features in deep neural networks?” in Advances in Neural Information Processing Systems, 2014, pp. 3320–3328.
15.
go back to reference S. Gutstein, O. Fuentes, and E. Freudenthal, “Knowledge transfer in deep convolutional neural nets,” International Journal on Artificial Intelligence Tools, vol. 17, no. 03, pp. 555–567, 2008. S. Gutstein, O. Fuentes, and E. Freudenthal, “Knowledge transfer in deep convolutional neural nets,” International Journal on Artificial Intelligence Tools, vol. 17, no. 03, pp. 555–567, 2008.
16.
go back to reference D. C. Cireşan, U. Meier, and J. Schmidhuber, “Transfer learning for latin and chinese characters with deep neural networks,” in Neural Networks (IJCNN), The 2012 International Joint Conference on. IEEE, 2012, pp. 1–6. D. C. Cireşan, U. Meier, and J. Schmidhuber, “Transfer learning for latin and chinese characters with deep neural networks,” in Neural Networks (IJCNN), The 2012 International Joint Conference on. IEEE, 2012, pp. 1–6.
17.
go back to reference C. Kandaswamy, L. M. Silva, L. A. Alexandre, J. M. Santos, and J. M. Sá, Artificial Neural Networks and Machine Learning - ICANN 2014: 24th International Conference on Artificial Neural Networks, Hamburg, Germany, September 15–19, 2014. Proceedings. Cham: Springer International Publishing, 2014, ch. Improving Deep Neural Network Performance by Reusing Features Trained with Transductive Transference, pp. 265–272. C. Kandaswamy, L. M. Silva, L. A. Alexandre, J. M. Santos, and J. M. Sá, Artificial Neural Networks and Machine Learning - ICANN 2014: 24th International Conference on Artificial Neural Networks, Hamburg, Germany, September 15–19, 2014. Proceedings. Cham: Springer International Publishing, 2014, ch. Improving Deep Neural Network Performance by Reusing Features Trained with Transductive Transference, pp. 265–272.
18.
go back to reference M. Long, Y. Cao, J. Wang, and M. Jordan, “Learning transferable features with deep adaptation networks,” in Proceedings of the 32nd International Conference on Machine Learning (ICML-15), D. Blei and F. Bach, Eds. JMLR Workshop and Conference Proceedings, 2015, pp. 97–105. [Online]. Available: http://jmlr.org/proceedings/papers/v37/long15.pdf. M. Long, Y. Cao, J. Wang, and M. Jordan, “Learning transferable features with deep adaptation networks,” in Proceedings of the 32nd International Conference on Machine Learning (ICML-15), D. Blei and F. Bach, Eds. JMLR Workshop and Conference Proceedings, 2015, pp. 97–105. [Online]. Available: http://​jmlr.​org/​proceedings/​papers/​v37/​long15.​pdf.
Metadata
Title
A Deep Autoencoder-Based Knowledge Transfer Approach
Author
Sreenivas Sremath Tirumala
Copyright Year
2018
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-6319-0_23

Premium Partner