Skip to main content

2022 | OriginalPaper | Buchkapitel

Deep Learning on Small Tabular Dataset: Using Transfer Learning and Image Classification

verfasst von : Vanshika Jain, Meghansh Goel, Kshitiz Shah

Erschienen in: Artificial Intelligence and Speech Technology

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Deep Learning is a subset of machine learning inspired by the human brain. It uses multiple layers of representation to extract specific knowledge from raw input. It is best suited for large image or sound-based datasets. Deep learning methods are generally avoided for small datasets because they tend to overfit. Transfer learning can be one approach used to solve this problem. However, in the case of tabular datasets, their heterogeneous nature makes transfer learning algorithms inapplicable. This paper aims to discuss a few approaches using a literature review to convert tabular data into images to overcome such limitations. The paper provides a 2-part study wherein we first give a brief overview of transfer learning enhancing the efficiency of deep learning algorithms and drastically reducing the training time for small datasets. Secondly, we provide a detailed study of different techniques available to convert tabular data into images for image classification such as SuperTML, IGTD, and REFINED approach. Furthermore, we propose a novel approach inspired by IGTD to create a blocked image representation of the tabular data on which we apply transfer learning to demonstrate the application of deep learning methods on small tabular datasets (with less than 1000 data points).

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Zhang, D., et al.: The AI Index 2021 Annual Report (2021) Zhang, D., et al.: The AI Index 2021 Annual Report (2021)
4.
Zurück zum Zitat Sun, B., et al.: SuperTML: Two-Dimensional Word Embedding and Transfer Learning Using ImageNet Pretrained CNN Models for the Classifications on Tabular Data (2019) Sun, B., et al.: SuperTML: Two-Dimensional Word Embedding and Transfer Learning Using ImageNet Pretrained CNN Models for the Classifications on Tabular Data (2019)
6.
Zurück zum Zitat Tan, C., Sun, F., Kong, T., Zhang, W., Yang, C., Liu, C.: A Survey on Deep Transfer Learning. arXiv (2018) Tan, C., Sun, F., Kong, T., Zhang, W., Yang, C., Liu, C.: A Survey on Deep Transfer Learning. arXiv (2018)
7.
Zurück zum Zitat Duan, L., Xu, D., Tsang, I.W.: Learning with Augmented Features for Heterogeneous Domain Adaptation. arXiv (2012) Duan, L., Xu, D., Tsang, I.W.: Learning with Augmented Features for Heterogeneous Domain Adaptation. arXiv (2012)
8.
Zurück zum Zitat Kulis, B., Saenko, K., Darrell, T.: What you saw is not what you get: domain adaptation using asymmetric kernel transforms. CVPR 2011, 1785–1792 (2011) Kulis, B., Saenko, K., Darrell, T.: What you saw is not what you get: domain adaptation using asymmetric kernel transforms. CVPR 2011, 1785–1792 (2011)
9.
Zurück zum Zitat Zhu, Y., et al.: Heterogeneous transfer learning for image classification. In: Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, AAAI (2011) Zhu, Y., et al.: Heterogeneous transfer learning for image classification. In: Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, AAAI (2011)
10.
Zurück zum Zitat Tianyi Zhou, J., Tsang, I.W., Jialin Pan, S., Tan, M.: Heterogeneous domain adaptation for multiple classes. In: AISTATS (International Conference on Artificial Intelligence and Statistics) (2014) Tianyi Zhou, J., Tsang, I.W., Jialin Pan, S., Tan, M.: Heterogeneous domain adaptation for multiple classes. In: AISTATS (International Conference on Artificial Intelligence and Statistics) (2014)
11.
Zurück zum Zitat Zhou, J.T., Pan, S.J., Tsang, I.W., Yan, Y.: Hybrid heterogeneous transfer learning through deep learning. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 28, no. 1 (2014) Zhou, J.T., Pan, S.J., Tsang, I.W., Yan, Y.: Hybrid heterogeneous transfer learning through deep learning. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 28, no. 1 (2014)
12.
Zurück zum Zitat Prettenhofer, P., Stein, B.: Cross-language text classification using structural correspondence learning. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp. 11–16 (2010) Prettenhofer, P., Stein, B.: Cross-language text classification using structural correspondence learning. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp. 11–16 (2010)
13.
Zurück zum Zitat Nam, J., Kim, S.: Heterogeneous defect prediction. IEEE Trans. Softw. Eng. 44(9), 874–896 (2015)CrossRef Nam, J., Kim, S.: Heterogeneous defect prediction. IEEE Trans. Softw. Eng. 44(9), 874–896 (2015)CrossRef
14.
Zurück zum Zitat Wang, C., Mahadevan, S.: Heterogeneous domain adaptation using manifold alignment. In: Proceedings of the 22nd International Joint Conference on Artificial Intelligence, pp. 1541–1546 (2011) Wang, C., Mahadevan, S.: Heterogeneous domain adaptation using manifold alignment. In: Proceedings of the 22nd International Joint Conference on Artificial Intelligence, pp. 1541–1546 (2011)
15.
Zurück zum Zitat Harel, M., Mannor, S.: Learning from Multiple Outlooks. arXiv (2011) Harel, M., Mannor, S.: Learning from Multiple Outlooks. arXiv (2011)
16.
Zurück zum Zitat Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 22(10), 1345–1359 (2010)CrossRef Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 22(10), 1345–1359 (2010)CrossRef
17.
Zurück zum Zitat Shu, M.: Deep learning for image classification on very small datasets using transfer learning using transfer learning. Creat. Compon. 345 (2019) Shu, M.: Deep learning for image classification on very small datasets using transfer learning using transfer learning. Creat. Compon. 345 (2019)
18.
Zurück zum Zitat Zhao, W.: Research on the deep learning of the small sample data based on transfer learning. AIP Conf. Proc. 1864, 020018 (2017)CrossRef Zhao, W.: Research on the deep learning of the small sample data based on transfer learning. AIP Conf. Proc. 1864, 020018 (2017)CrossRef
19.
Zurück zum Zitat Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. Commun. ACM 60(6), 84–90 (2017)CrossRef Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. Commun. ACM 60(6), 84–90 (2017)CrossRef
20.
Zurück zum Zitat Sharma, A., Vans, E., Shigemizu, D., Boroevich, K.A., Tsunoda, T.: DeepInsight: a methodology to transform a non-image data to an image for convolution neural network architecture. Sci. Rep. 9(1), 11399 (2019)CrossRef Sharma, A., Vans, E., Shigemizu, D., Boroevich, K.A., Tsunoda, T.: DeepInsight: a methodology to transform a non-image data to an image for convolution neural network architecture. Sci. Rep. 9(1), 11399 (2019)CrossRef
21.
Zurück zum Zitat Ma, S., Zhang, Z.: OmicsMapNet: Transforming omics data to take advantage of Deep Convolutional Neural Network for discovery. arXiv (2018) Ma, S., Zhang, Z.: OmicsMapNet: Transforming omics data to take advantage of Deep Convolutional Neural Network for discovery. arXiv (2018)
22.
Zurück zum Zitat He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778 (2016) He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778 (2016)
23.
Zurück zum Zitat Smith, L.N.: Cyclical learning rates for training neural networks. In: 2017 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 464–472 (2017) Smith, L.N.: Cyclical learning rates for training neural networks. In: 2017 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 464–472 (2017)
24.
Zurück zum Zitat Smith, L.N.: A Disciplined Approach to Neural Network Hyper-Parameters: Part 1. Learning Rate, Batch Size, Momentum, and Weight Decay. arXiv (2018) Smith, L.N.: A Disciplined Approach to Neural Network Hyper-Parameters: Part 1. Learning Rate, Batch Size, Momentum, and Weight Decay. arXiv (2018)
25.
Zurück zum Zitat Larabi-Marie-Sainte, S., Aburahmah, L., Almohaini, R., Saba, T.: Current techniques for diabetes prediction: review and case study. Appl. Sci. 9(21), 4604 (2019)CrossRef Larabi-Marie-Sainte, S., Aburahmah, L., Almohaini, R., Saba, T.: Current techniques for diabetes prediction: review and case study. Appl. Sci. 9(21), 4604 (2019)CrossRef
27.
Zurück zum Zitat Khashman, A., Ebenezer, O., Oyedot, O., Munawar, S., Olaniyi, E.O., Adnan, K.: Onset diabetes diagnosis using artificial neural network. Int. J. Sci. Eng. Res. 5(10), 754–759 (2014) Khashman, A., Ebenezer, O., Oyedot, O., Munawar, S., Olaniyi, E.O., Adnan, K.: Onset diabetes diagnosis using artificial neural network. Int. J. Sci. Eng. Res. 5(10), 754–759 (2014)
28.
Zurück zum Zitat Soltani, Z., Jafarian, A.: A new artificial neural networks approach for diagnosing diabetes disease type II. Int. J. Adv. Comput. Sci. Appl. 7(6), 89–94 (2016) Soltani, Z., Jafarian, A.: A new artificial neural networks approach for diagnosing diabetes disease type II. Int. J. Adv. Comput. Sci. Appl. 7(6), 89–94 (2016)
Metadaten
Titel
Deep Learning on Small Tabular Dataset: Using Transfer Learning and Image Classification
verfasst von
Vanshika Jain
Meghansh Goel
Kshitiz Shah
Copyright-Jahr
2022
DOI
https://doi.org/10.1007/978-3-030-95711-7_46

Premium Partner