nach oben

Erschienen in:

2022 | OriginalPaper | Buchkapitel

Deep Learning on Small Tabular Dataset: Using Transfer Learning and Image Classification

verfasst von : Vanshika Jain, Meghansh Goel, Kshitiz Shah

Erschienen in: Artificial Intelligence and Speech Technology

Verlag: Springer International Publishing

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Deep Learning is a subset of machine learning inspired by the human brain. It uses multiple layers of representation to extract specific knowledge from raw input. It is best suited for large image or sound-based datasets. Deep learning methods are generally avoided for small datasets because they tend to overfit. Transfer learning can be one approach used to solve this problem. However, in the case of tabular datasets, their heterogeneous nature makes transfer learning algorithms inapplicable. This paper aims to discuss a few approaches using a literature review to convert tabular data into images to overcome such limitations. The paper provides a 2-part study wherein we first give a brief overview of transfer learning enhancing the efficiency of deep learning algorithms and drastically reducing the training time for small datasets. Secondly, we provide a detailed study of different techniques available to convert tabular data into images for image classification such as SuperTML, IGTD, and REFINED approach. Furthermore, we propose a novel approach inspired by IGTD to create a blocked image representation of the tabular data on which we apply transfer learning to demonstrate the application of deep learning methods on small tabular datasets (with less than 1000 data points).

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel An Analysis of Image Compression Using Neural Network

Nächstes Kapitel Implementation of a Method Using Image Sequentialization, Patch Embedding and ViT Encoder to Detect the Breast Cancer on RGBA Images and Binary Masks

Zhang, D., et al.: The AI Index 2021 Annual Report (2021)

Zhu, Y., et al.: Converting tabular data into images for deep learning with convolutional neural networks. Sci. Rep. 11(1), 11325 (2021). https://doi.org/10.1038/s41598-021-90923-yCrossRef

Bazgir, O., Zhang, R., Dhruba, S.R., Rahman, R., Ghosh, S., Pal, R.: Representation of features as images with neighborhood dependencies for compatibility with convolutional neural networks. Nat. Commun. 11(1), 4391 (2020). https://doi.org/10.1038/s41467-020-18197-yCrossRef

Sun, B., et al.: SuperTML: Two-Dimensional Word Embedding and Transfer Learning Using ImageNet Pretrained CNN Models for the Classifications on Tabular Data (2019)

Weiss, K., Khoshgoftaar, T.M., Wang, D.: A survey of transfer learning. J. Big Data 3(1), 1–40 (2016). https://doi.org/10.1186/s40537-016-0043-6CrossRef

Tan, C., Sun, F., Kong, T., Zhang, W., Yang, C., Liu, C.: A Survey on Deep Transfer Learning. arXiv (2018)

Duan, L., Xu, D., Tsang, I.W.: Learning with Augmented Features for Heterogeneous Domain Adaptation. arXiv (2012)

Kulis, B., Saenko, K., Darrell, T.: What you saw is not what you get: domain adaptation using asymmetric kernel transforms. CVPR 2011, 1785–1792 (2011)

Zhu, Y., et al.: Heterogeneous transfer learning for image classification. In: Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, AAAI (2011)

10.

Tianyi Zhou, J., Tsang, I.W., Jialin Pan, S., Tan, M.: Heterogeneous domain adaptation for multiple classes. In: AISTATS (International Conference on Artificial Intelligence and Statistics) (2014)

11.

Zhou, J.T., Pan, S.J., Tsang, I.W., Yan, Y.: Hybrid heterogeneous transfer learning through deep learning. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 28, no. 1 (2014)

12.

Prettenhofer, P., Stein, B.: Cross-language text classification using structural correspondence learning. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp. 11–16 (2010)

13.

Nam, J., Kim, S.: Heterogeneous defect prediction. IEEE Trans. Softw. Eng. 44(9), 874–896 (2015)CrossRef

14.

Wang, C., Mahadevan, S.: Heterogeneous domain adaptation using manifold alignment. In: Proceedings of the 22nd International Joint Conference on Artificial Intelligence, pp. 1541–1546 (2011)

15.

Harel, M., Mannor, S.: Learning from Multiple Outlooks. arXiv (2011)

16.

Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 22(10), 1345–1359 (2010)CrossRef

17.

Shu, M.: Deep learning for image classification on very small datasets using transfer learning using transfer learning. Creat. Compon. 345 (2019)

18.

Zhao, W.: Research on the deep learning of the small sample data based on transfer learning. AIP Conf. Proc. 1864, 020018 (2017)CrossRef

19.

Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. Commun. ACM 60(6), 84–90 (2017)CrossRef

20.

Sharma, A., Vans, E., Shigemizu, D., Boroevich, K.A., Tsunoda, T.: DeepInsight: a methodology to transform a non-image data to an image for convolution neural network architecture. Sci. Rep. 9(1), 11399 (2019)CrossRef

21.

Ma, S., Zhang, Z.: OmicsMapNet: Transforming omics data to take advantage of Deep Convolutional Neural Network for discovery. arXiv (2018)

22.

He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778 (2016)

23.

Smith, L.N.: Cyclical learning rates for training neural networks. In: 2017 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 464–472 (2017)

24.

Smith, L.N.: A Disciplined Approach to Neural Network Hyper-Parameters: Part 1. Learning Rate, Batch Size, Momentum, and Weight Decay. arXiv (2018)

25.

Larabi-Marie-Sainte, S., Aburahmah, L., Almohaini, R., Saba, T.: Current techniques for diabetes prediction: review and case study. Appl. Sci. 9(21), 4604 (2019)CrossRef

26.

American Diabetes Association: Type 2 diabetes in children and adolescents. Pediatrics 105(3), 671–680 (2000). https://doi.org/10.1542/peds.105.3.671CrossRef

27.

Khashman, A., Ebenezer, O., Oyedot, O., Munawar, S., Olaniyi, E.O., Adnan, K.: Onset diabetes diagnosis using artificial neural network. Int. J. Sci. Eng. Res. 5(10), 754–759 (2014)

28.

Soltani, Z., Jafarian, A.: A new artificial neural networks approach for diagnosing diabetes disease type II. Int. J. Adv. Comput. Sci. Appl. 7(6), 89–94 (2016)

29.

Ashiquzzaman, A., et al.: Reduction of overfitting in diabetes prediction using deep learning neural network. In: Kim, Kuinam J., Kim, Hyuncheol, Baek, Nakhoon (eds.) IT Convergence and Security 2017, pp. 35–43. Springer Singapore, Singapore (2018). https://doi.org/10.1007/978-981-10-6451-7_5CrossRef

30.

Rakshit, S., et al.: Prediction of diabetes Type-II using a two-class neural network. In: Mandal, J.K., Dutta, P., Mukhopadhyay, S. (eds.) CICBA 2017. CCIS, vol. 776, pp. 65–71. Springer, Singapore (2017). https://doi.org/10.1007/978-981-10-6430-2_6CrossRef

Titel: Deep Learning on Small Tabular Dataset: Using Transfer Learning and Image Classification
verfasst von: Vanshika Jain
Meghansh Goel
Kshitiz Shah
Verlag: Springer International Publishing
Buch: Artificial Intelligence and Speech Technology
Print ISBN: 978-3-030-95710-0

Electronic ISBN: 978-3-030-95711-7

Copyright-Jahr: 2022
DOI: https://doi.org/10.1007/978-3-030-95711-7_46

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Premium Partner