Skip to main content
Erschienen in: Soft Computing 5/2020

05.06.2019 | Methodologies and Application

Hyperparameter tuning in convolutional neural networks for domain adaptation in sentiment classification (HTCNN-DASC)

verfasst von: K. Krishnakumari, E. Sivasankar, Sam Radhakrishnan

Erschienen in: Soft Computing | Ausgabe 5/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In-domain adaptation (DA), the knowledge trained in one domain, is used to test an unknown domain. Existing approaches use limited efforts on DA in sentiment classification (SC) using neural networks. The challenging task here is the dissimilarity in the semantic behavior across domains. In this paper, convolutional neural networks (CNNs) learn the knowledge of a particular domain using Doc2Vec feature representation which provides good performance for DA in SC for the target domain. Our empirical analysis with one-layer CNN exhibits significant change in the accuracy by tuning the hyperparameters involved with the CNN. This paper derives into a suitable CNN architecture accompanying hyperparameters which favor DA between different domains. Our empirical analysis with multi-domain dataset demonstrates that with suitable hyperparameters, CNN works well for DASC problems. The comparative study shows that CNN with Doc2Vec model provides a strong capability of learning large data representation semantically with other state-of-the-art methods for the DASC.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Blitzer J, Dredze M, Pereira F et al (2007) Biographies, bollywood, boom-boxes and blenders: domain adaptation for sentiment classification. ACL 7:440–447 Blitzer J, Dredze M, Pereira F et al (2007) Biographies, bollywood, boom-boxes and blenders: domain adaptation for sentiment classification. ACL 7:440–447
Zurück zum Zitat Bollegala D, Weir D, Carroll J (2013) Cross-domain sentiment classification using a sentiment sensitive thesaurus. IEEE Trans Knowl Data Eng 25:1719–1731CrossRef Bollegala D, Weir D, Carroll J (2013) Cross-domain sentiment classification using a sentiment sensitive thesaurus. IEEE Trans Knowl Data Eng 25:1719–1731CrossRef
Zurück zum Zitat Dai W, Yang Q, Xue GR, Yu Y (2007) Boosting for transfer learning. In: Proceedings of the 24th international conference on machine learning. ACM, pp 193–200 Dai W, Yang Q, Xue GR, Yu Y (2007) Boosting for transfer learning. In: Proceedings of the 24th international conference on machine learning. ACM, pp 193–200
Zurück zum Zitat Dai AM, Olah C, Le QV (2015) Document embedding with paragraph vectors. In: NIPS deep learning workshop Dai AM, Olah C, Le QV (2015) Document embedding with paragraph vectors. In: NIPS deep learning workshop
Zurück zum Zitat Glorot X, Bordes A, Bengio Y (2011) Domain adaptation for large-scale sentiment classification: a deep learning approach. In: Proceedings of the 28th international conference on machine learning (ICML-11), pp 513–520 Glorot X, Bordes A, Bengio Y (2011) Domain adaptation for large-scale sentiment classification: a deep learning approach. In: Proceedings of the 28th international conference on machine learning (ICML-11), pp 513–520
Zurück zum Zitat He Y, Lin C, Alani H (2011) Automatically extracting polarity-bearing topics for cross-domain sentiment classification. In: Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies-Volume 1. Association for Computational Linguistics, pp 123–131 He Y, Lin C, Alani H (2011) Automatically extracting polarity-bearing topics for cross-domain sentiment classification. In: Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies-Volume 1. Association for Computational Linguistics, pp 123–131
Zurück zum Zitat Jain V, Learned-Miller E (2011) Online domain adaptation of a pre-trained cascade of classifiers. In: 2011 IEEE conference on computer vision and pattern recognition (CVPR). IEEE, pp 577–584 Jain V, Learned-Miller E (2011) Online domain adaptation of a pre-trained cascade of classifiers. In: 2011 IEEE conference on computer vision and pattern recognition (CVPR). IEEE, pp 577–584
Zurück zum Zitat Johnson R, Zhang T (2015) Effective use of word order for text categorization with convolutional neural networks. In: Proceedings of the 2015 conference of the North American chapter of the association for computational linguistics: human language technologies, pp 103–112 Johnson R, Zhang T (2015) Effective use of word order for text categorization with convolutional neural networks. In: Proceedings of the 2015 conference of the North American chapter of the association for computational linguistics: human language technologies, pp 103–112
Zurück zum Zitat Kalchbrenner N, Grefenstette E, Blunsom P (2014) A convolutional neural network for modelling sentences. In: Proceedings of the 52nd annual meeting of the association for computational linguistics (Volume 1: Long Papers). Association for Computational Linguistics, Baltimore, Maryland, pp 655–665. http://www.aclweb.org/anthology/P/P14/P14-1062. Accessed 4 Dec 2016 Kalchbrenner N, Grefenstette E, Blunsom P (2014) A convolutional neural network for modelling sentences. In: Proceedings of the 52nd annual meeting of the association for computational linguistics (Volume 1: Long Papers). Association for Computational Linguistics, Baltimore, Maryland, pp 655–665. http://​www.​aclweb.​org/​anthology/​P/​P14/​P14-1062. Accessed 4 Dec 2016
Zurück zum Zitat Kandaswamy C, Silva LM, Alexandre LA, Santos JM, de Sá JM (2014) Improving deep neural network performance by reusing features trained with transductive transference. In: International conference on artificial neural networks. Springer, pp 265–272 Kandaswamy C, Silva LM, Alexandre LA, Santos JM, de Sá JM (2014) Improving deep neural network performance by reusing features trained with transductive transference. In: International conference on artificial neural networks. Springer, pp 265–272
Zurück zum Zitat Kim Y (2014) Convolutional neural networks for sentence classification. In: Proceedings of the 2014 conference on empirical methods in natural language processing, EMNLP 2014, October 25–29, 2014, Doha, Qatar, A meeting of SIGDAT, a Special Interest Group of the ACL, pp 1746–1751. http://aclweb.org/anthology/D/D14/D14-1181.pdf. Accessed 12 Feb 2017 Kim Y (2014) Convolutional neural networks for sentence classification. In: Proceedings of the 2014 conference on empirical methods in natural language processing, EMNLP 2014, October 25–29, 2014, Doha, Qatar, A meeting of SIGDAT, a Special Interest Group of the ACL, pp 1746–1751. http://​aclweb.​org/​anthology/​D/​D14/​D14-1181.​pdf. Accessed 12 Feb 2017
Zurück zum Zitat Le Q, Mikolov T (2014) Distributed representations of sentences and documents. In: Proceedings of the 31st international conference on machine learning (ICML-14), pp 1188–1196 Le Q, Mikolov T (2014) Distributed representations of sentences and documents. In: Proceedings of the 31st international conference on machine learning (ICML-14), pp 1188–1196
Zurück zum Zitat Li L, Qin B, Ren W, Liu T (2017) Document representation and feature combination for deceptive spam review detection. Neurocomputing 254:33–41CrossRef Li L, Qin B, Ren W, Liu T (2017) Document representation and feature combination for deceptive spam review detection. Neurocomputing 254:33–41CrossRef
Zurück zum Zitat Mikolov T, Chen K, Corrado G, Dean J (2013) Efficient estimation of word representations in vector space. In: Proceedings of the international conference on learning representations (ICLR) 2013 Mikolov T, Chen K, Corrado G, Dean J (2013) Efficient estimation of word representations in vector space. In: Proceedings of the international conference on learning representations (ICLR) 2013
Zurück zum Zitat Pan SJ, Yang Q (2010) A survey on transfer learning. IEEE Trans Knowl Data Eng 22:1345–1359CrossRef Pan SJ, Yang Q (2010) A survey on transfer learning. IEEE Trans Knowl Data Eng 22:1345–1359CrossRef
Zurück zum Zitat Pan W, Zhong E, Yang Q (2012) Transfer learning for text mining. In: Aggarwal C, Zhai C (eds) Mining text data. Springer, Boston, MA, pp 223–257CrossRef Pan W, Zhong E, Yang Q (2012) Transfer learning for text mining. In: Aggarwal C, Zhai C (eds) Mining text data. Springer, Boston, MA, pp 223–257CrossRef
Zurück zum Zitat Rojas-Barahona LM (2016) Deep learning for sentiment analysis. Lang Linguist Compass 10(12):701–719CrossRef Rojas-Barahona LM (2016) Deep learning for sentiment analysis. Lang Linguist Compass 10(12):701–719CrossRef
Zurück zum Zitat Rosenstein MT, Marx Z, Kaelbling LP, Dietterich TG (2005) To transfer or not to transfer. In: NIPS 2005 workshop on transfer learning, vol 898 Rosenstein MT, Marx Z, Kaelbling LP, Dietterich TG (2005) To transfer or not to transfer. In: NIPS 2005 workshop on transfer learning, vol 898
Zurück zum Zitat Sanguansat P (2016) Paragraph2vec-based sentiment analysis on social media for business in Thailand. In: 2016 8th international conference on knowledge and smart technology (KST). IEEE, pp 175–178 Sanguansat P (2016) Paragraph2vec-based sentiment analysis on social media for business in Thailand. In: 2016 8th international conference on knowledge and smart technology (KST). IEEE, pp 175–178
Zurück zum Zitat Shen Y, He X, Gao J, Deng L, Mesnil G (2014) Learning semantic representations using convolutional neural networks for web search. In: Proceedings of the 23rd international conference on world wide web. ACM, pp 373–374 Shen Y, He X, Gao J, Deng L, Mesnil G (2014) Learning semantic representations using convolutional neural networks for web search. In: Proceedings of the 23rd international conference on world wide web. ACM, pp 373–374
Zurück zum Zitat Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. In: Proceedings of the international conference on learning representations (ICLR) 2015 Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. In: Proceedings of the international conference on learning representations (ICLR) 2015
Zurück zum Zitat Sun S, Shi H, Wu Y (2015) A survey of multi-source domain adaptation. Inf Fusion 24:84–92CrossRef Sun S, Shi H, Wu Y (2015) A survey of multi-source domain adaptation. Inf Fusion 24:84–92CrossRef
Zurück zum Zitat Tang D, Qin B, Liu T (2015) Deep learning for sentiment analysis: successful approaches and future challenges. Wiley Interdiscip Rev Data Min Knowl Discov 5(6):292–303CrossRef Tang D, Qin B, Liu T (2015) Deep learning for sentiment analysis: successful approaches and future challenges. Wiley Interdiscip Rev Data Min Knowl Discov 5(6):292–303CrossRef
Zurück zum Zitat Tang D, Qin B, Liu T (2016) Aspect level sentiment classification with deep memory network. In: Proceedings of the 2016 conference on empirical methods in natural language processing. ACM, pp 214–224 Tang D, Qin B, Liu T (2016) Aspect level sentiment classification with deep memory network. In: Proceedings of the 2016 conference on empirical methods in natural language processing. ACM, pp 214–224
Zurück zum Zitat Wei X, Lin H, Yu Y, Yang L (2017) Low-resource cross-domain product review sentiment classification based on a cnn with an auxiliary large-scale corpus. Algorithms 10(3):81CrossRef Wei X, Lin H, Yu Y, Yang L (2017) Low-resource cross-domain product review sentiment classification based on a cnn with an auxiliary large-scale corpus. Algorithms 10(3):81CrossRef
Zurück zum Zitat Whitehead M, Yaeger L (2009) Building a general purpose cross-domain sentiment mining model. In: 2009 WRI world congress on computer science and information engineering. IEEE, vol 4, pp 472–476 Whitehead M, Yaeger L (2009) Building a general purpose cross-domain sentiment mining model. In: 2009 WRI world congress on computer science and information engineering. IEEE, vol 4, pp 472–476
Zurück zum Zitat Wu H, Gu Y, Sun S, Gu X (2016) Aspect-based opinion summarization with convolutional neural networks. In: 2016 international joint conference on neural networks (IJCNN). IEEE, pp 3157–3163 Wu H, Gu Y, Sun S, Gu X (2016) Aspect-based opinion summarization with convolutional neural networks. In: 2016 international joint conference on neural networks (IJCNN). IEEE, pp 3157–3163
Zurück zum Zitat Wu F, Huang Y, Yuan Z (2017) Domain-specific sentiment classification via fusing sentiment knowledge from multiple sources. Inf Fusion 35:26–37CrossRef Wu F, Huang Y, Yuan Z (2017) Domain-specific sentiment classification via fusing sentiment knowledge from multiple sources. Inf Fusion 35:26–37CrossRef
Zurück zum Zitat Zeng D, Liu K, Lai S, Zhou G, Zhao J et al (2014) Relation classification via convolutional deep neural network. In: COLING, pp 2335–2344 Zeng D, Liu K, Lai S, Zhou G, Zhao J et al (2014) Relation classification via convolutional deep neural network. In: COLING, pp 2335–2344
Zurück zum Zitat Zhang Y, Wallace B (2017) A sensitivity analysis of (and practitioners’ guide to) convolutional neural networks for sentence classification. In: Proceedings of the 8th international joint conference on natural language processing, AFNLP, pp 253–263 Zhang Y, Wallace B (2017) A sensitivity analysis of (and practitioners’ guide to) convolutional neural networks for sentence classification. In: Proceedings of the 8th international joint conference on natural language processing, AFNLP, pp 253–263
Zurück zum Zitat Zhang S, Liu H, Yang L, Lin H (2015) A cross-domain sentiment classification method based on extraction of key sentiment sentence. In: Li J, Ji H, Zhao D, Feng Y (eds) Natural language processing and Chinese computing. Springer, Cham, pp 90–101CrossRef Zhang S, Liu H, Yang L, Lin H (2015) A cross-domain sentiment classification method based on extraction of key sentiment sentence. In: Li J, Ji H, Zhao D, Feng Y (eds) Natural language processing and Chinese computing. Springer, Cham, pp 90–101CrossRef
Zurück zum Zitat Zhou S, Chen Q, Wang X (2013) Active deep learning method for semi-supervised sentiment classification. Neurocomputing 120:536–546CrossRef Zhou S, Chen Q, Wang X (2013) Active deep learning method for semi-supervised sentiment classification. Neurocomputing 120:536–546CrossRef
Zurück zum Zitat Zhu E, Huang G, Mo B, Wu Q (2016) Features extraction based on neural network for cross-domain sentiment classification. In: International conference on database systems for advanced applications. Springer, pp 81–88 Zhu E, Huang G, Mo B, Wu Q (2016) Features extraction based on neural network for cross-domain sentiment classification. In: International conference on database systems for advanced applications. Springer, pp 81–88
Metadaten
Titel
Hyperparameter tuning in convolutional neural networks for domain adaptation in sentiment classification (HTCNN-DASC)
verfasst von
K. Krishnakumari
E. Sivasankar
Sam Radhakrishnan
Publikationsdatum
05.06.2019
Verlag
Springer Berlin Heidelberg
Erschienen in
Soft Computing / Ausgabe 5/2020
Print ISSN: 1432-7643
Elektronische ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-019-04117-w

Weitere Artikel der Ausgabe 5/2020

Soft Computing 5/2020 Zur Ausgabe

Methodologies and Application

Two-dimensional perceptrons

Premium Partner