Skip to main content
Top
Published in: Neural Processing Letters 4/2022

02-02-2022

Domain Adaptation for POS Tagging with Contrastive Monotonic Chunk-wise Attention

Authors: Rajesh Kumar Mundotiya, Arpit Mehta, Rupjyoti Baruah

Published in: Neural Processing Letters | Issue 4/2022

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Part of Speech (POS) tagging is a sequential labelling task and one of the core applications of Natural Language Processing. It has been a challenging problem for the low resource languages. Sequential labelling algorithms aim to model relationships among the words of a sentence. Availability of annotated datasets in ample amounts is another challenge for low resource languages. Contrastive training has been tried as a robust approach that captures the essential features during model training and based on this, Contrastive Monotonic Chunk-wise attention with CNN-GRU-Softmax (CMCCGS) model architecture has been proposed for POS tagging. It learns optimal features in a low resource regime. It comprises three components: contrastive training, monotonic chunk-wise attention and CNN-GRU-Softmax, where Monotonic Chunk-wise attention exploits the discrete and chunk level dependencies. We experimented on the datasets of four domains, Article, Conversation, Disease and Tourism, of the Hindi treebank, Tweet domain from TweeBank, Newswire domain from Penn TreeBank (PTB) and Tweet domain from ARK and compared it with several state-of-the-art models. We have obtained \(96.63\%\), \(94.34\%\), \(91.24\%\), \(93.76\%\), \(92.30\%\), \(97.51\%\) and \(93.55\%\) accuracy on respective domains after CMCCGS has been applied. CMCCGS model has been further extended to domain adaptation by using single and multi-source domain adaptation to allow fine-tuning. It is analysed the effects on different layers. The extremely low resource domains such as Tourism, Disease and tweet domain of TweeBank and ARK have shown improvement in accuracy of \(+3.00\% (96.76\%)\) by an Article domain, \(+4.14\% (95.38\%)\) by Article and Tourism (multi-source), \(+2.93\% (95.23\%)\) by PTB domain and \(+1.43\% (94.98\%)\) by PTB and TweeBank (multi-source) as source domain, respectively. However, the Conversation domain has a negative impact on domain adaptation.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Bhat RA, Bhatt R, Farudi A, Klassen P, Narasimhan B, Palmer M, Rambow O, Sharma DM, Vaidya A, Vishnu SR et al (2017) The hindi/urdu treebank project. In: Handbook of Linguistic Annotation. Springer, pp 659–697 Bhat RA, Bhatt R, Farudi A, Klassen P, Narasimhan B, Palmer M, Rambow O, Sharma DM, Vaidya A, Vishnu SR et al (2017) The hindi/urdu treebank project. In: Handbook of Linguistic Annotation. Springer, pp 659–697
2.
go back to reference Blitzer J, McDonald R, Pereira F (2006) Domain adaptation with structural correspondence learning. In: Proceedings of the 2006 conference on empirical methods in natural language processing, pp 120–128 Blitzer J, McDonald R, Pereira F (2006) Domain adaptation with structural correspondence learning. In: Proceedings of the 2006 conference on empirical methods in natural language processing, pp 120–128
3.
go back to reference Chelba C, Acero A (2006) Adaptation of maximum entropy capitalizer: little data can help a lot. Comput Speech Lang 20(4):382–399CrossRef Chelba C, Acero A (2006) Adaptation of maximum entropy capitalizer: little data can help a lot. Comput Speech Lang 20(4):382–399CrossRef
5.
go back to reference Cho K, van Merrienboer B, Gülçehre Ç, Bahdanau D, Bougares F, Schwenk H, Bengio Y (2014) Learning phrase representations using RNN encoder-decoder for statistical machine translation. In: Moschitti A, Pang B, Daelemans W (eds) Proceedings of the 2014 conference on empirical methods in natural language processing, EMNLP 2014, October 25–29, 2014, Doha, Qatar, A meeting of SIGDAT, a Special Interest Group of the ACL, pp 1724–1734. ACL . https://doi.org/10.3115/v1/d14-1179 Cho K, van Merrienboer B, Gülçehre Ç, Bahdanau D, Bougares F, Schwenk H, Bengio Y (2014) Learning phrase representations using RNN encoder-decoder for statistical machine translation. In: Moschitti A, Pang B, Daelemans W (eds) Proceedings of the 2014 conference on empirical methods in natural language processing, EMNLP 2014, October 25–29, 2014, Doha, Qatar, A meeting of SIGDAT, a Special Interest Group of the ACL, pp 1724–1734. ACL . https://​doi.​org/​10.​3115/​v1/​d14-1179
6.
go back to reference Collobert R, Weston J, Bottou L, Karlen M, Kavukcuoglu K, Kuksa P (2011) Natural language processing (almost) from scratch. J Mach Learn Res 12:2493–2537MATH Collobert R, Weston J, Bottou L, Karlen M, Kavukcuoglu K, Kuksa P (2011) Natural language processing (almost) from scratch. J Mach Learn Res 12:2493–2537MATH
7.
8.
go back to reference Daumé III H, Kumar A, Saha A (2010) Frustratingly easy semi-supervised domain adaptation. In: Proceedings of the 2010 workshop on domain adaptation for natural language processing. Association for Computational Linguistics, pp 53–59 Daumé III H, Kumar A, Saha A (2010) Frustratingly easy semi-supervised domain adaptation. In: Proceedings of the 2010 workshop on domain adaptation for natural language processing. Association for Computational Linguistics, pp 53–59
9.
go back to reference Daumé III H, Marcu D (2005) Learning as search optimization: approximate large margin methods for structured prediction. In: Proceedings of the 22nd international conference on machine learning, pp 169–176 Daumé III H, Marcu D (2005) Learning as search optimization: approximate large margin methods for structured prediction. In: Proceedings of the 22nd international conference on machine learning, pp 169–176
10.
go back to reference Du C, Sun H, Wang J, Qi Q, Liao J (2020) Adversarial and domain-aware bert for cross-domain sentiment analysis. In: Proceedings of the 58th annual meeting of the association for computational linguistics, pp 4019–4028 Du C, Sun H, Wang J, Qi Q, Liao J (2020) Adversarial and domain-aware bert for cross-domain sentiment analysis. In: Proceedings of the 58th annual meeting of the association for computational linguistics, pp 4019–4028
11.
go back to reference Ferraro JP, Daumé H III, DuVall SL, Chapman WW, Harkema H, Haug PJ (2013) Improving performance of natural language processing part-of-speech tagging on clinical narratives through domain adaptation. J Am Med Inf Assoc 20(5):931–939CrossRef Ferraro JP, Daumé H III, DuVall SL, Chapman WW, Harkema H, Haug PJ (2013) Improving performance of natural language processing part-of-speech tagging on clinical narratives through domain adaptation. J Am Med Inf Assoc 20(5):931–939CrossRef
12.
go back to reference Globerson A, Roweis S (2006) Nightmare at test time: robust learning by feature deletion. In: Proceedings of the 23rd international conference on Machine learning, pp 353–360 Globerson A, Roweis S (2006) Nightmare at test time: robust learning by feature deletion. In: Proceedings of the 23rd international conference on Machine learning, pp 353–360
15.
go back to reference Huang F, Yates A (2009) Distributional representations for handling sparsity in supervised sequence-labeling. In: Proceedings of the joint conference of the 47th annual meeting of the ACL and the 4th international joint conference on natural language processing of the AFNLP. Association for Computational Linguistics, vol 1, pp 495–503 Huang F, Yates A (2009) Distributional representations for handling sparsity in supervised sequence-labeling. In: Proceedings of the joint conference of the 47th annual meeting of the ACL and the 4th international joint conference on natural language processing of the AFNLP. Association for Computational Linguistics, vol 1, pp 495–503
16.
go back to reference Huang F, Yates A (2010) Exploring representation-learning approaches to domain adaptation. In: Proceedings of the 2010 workshop on domain adaptation for natural language processing. Association for Computational Linguistics, pp 23–30 Huang F, Yates A (2010) Exploring representation-learning approaches to domain adaptation. In: Proceedings of the 2010 workshop on domain adaptation for natural language processing. Association for Computational Linguistics, pp 23–30
17.
go back to reference Kim Y (2014) Convolutional neural networks for sentence classification. In: A. Moschitti, B. Pang, W. Daelemans (eds.) Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, EMNLP 2014, October 25-29, 2014, Doha, Qatar, A meeting of SIGDAT, a Special Interest Group of the ACL. ACL, pp 1746–1751. https://doi.org/10.3115/v1/d14-1181 Kim Y (2014) Convolutional neural networks for sentence classification. In: A. Moschitti, B. Pang, W. Daelemans (eds.) Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, EMNLP 2014, October 25-29, 2014, Doha, Qatar, A meeting of SIGDAT, a Special Interest Group of the ACL. ACL, pp 1746–1751. https://​doi.​org/​10.​3115/​v1/​d14-1181
18.
go back to reference Kruengkrai C, Uchimoto K, Kazama J, Wang Y, Torisawa K, Isahara H (2009) An error-driven word-character hybrid model for joint chinese word segmentation and pos tagging. In: Proceedings of the joint conference of the 47th annual meeting of the ACL and the 4th international joint conference on natural language processing of the AFNLP. Association for Computational Linguistics, vol 1, pp 513–521 Kruengkrai C, Uchimoto K, Kazama J, Wang Y, Torisawa K, Isahara H (2009) An error-driven word-character hybrid model for joint chinese word segmentation and pos tagging. In: Proceedings of the joint conference of the 47th annual meeting of the ACL and the 4th international joint conference on natural language processing of the AFNLP. Association for Computational Linguistics, vol 1, pp 513–521
19.
go back to reference Liu K, Chapman W, Hwa R, Crowley RS (2007) Heuristic sample selection to minimize reference standard training set for a part-of-speech tagger. J Am Med Inf Assoc 14(5):641–650CrossRef Liu K, Chapman W, Hwa R, Crowley RS (2007) Heuristic sample selection to minimize reference standard training set for a part-of-speech tagger. J Am Med Inf Assoc 14(5):641–650CrossRef
21.
go back to reference Liu Y, Zhang Y (2012) Unsupervised domain adaptation for joint segmentation and pos-tagging. In: Proceedings of COLING 2012: Posters, pp 745–754 Liu Y, Zhang Y (2012) Unsupervised domain adaptation for joint segmentation and pos-tagging. In: Proceedings of COLING 2012: Posters, pp 745–754
22.
go back to reference Liu Y, Zhu Y, Che W, Qin B, Schneider N, Smith NA (2018) Parsing tweets into universal dependencies. In: Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics: human language technologies, vol 1 (Long Papers), pp 965–975 Liu Y, Zhu Y, Che W, Qin B, Schneider N, Smith NA (2018) Parsing tweets into universal dependencies. In: Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics: human language technologies, vol 1 (Long Papers), pp 965–975
24.
go back to reference Marcus MP, Marcinkiewicz MA, Santorini B (1993) Building a large annotated corpus of english: the penn treebank. Comput Linguist 19(2):313–330 Marcus MP, Marcinkiewicz MA, Santorini B (1993) Building a large annotated corpus of english: the penn treebank. Comput Linguist 19(2):313–330
25.
go back to reference März L, Trautmann D, Roth B (2019) Domain adaptation for part-of-speech tagging of noisy user-generated text. In: Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, vol 1 (Long and Short Papers), pp 3415–3420 März L, Trautmann D, Roth B (2019) Domain adaptation for part-of-speech tagging of noisy user-generated text. In: Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, vol 1 (Long and Short Papers), pp 3415–3420
26.
go back to reference Meftah S, Semmar N (2018) A neural network model for part-of-speech tagging of social media texts. In: Proceedings of the eleventh international conference on language resources and evaluation (LREC 2018) Meftah S, Semmar N (2018) A neural network model for part-of-speech tagging of social media texts. In: Proceedings of the eleventh international conference on language resources and evaluation (LREC 2018)
27.
go back to reference Miller J, Torii M, Vijay-Shanker K (2007) Adaptation of pos tagging for multiple biomedical domains. In: Biological, translational, and clinical language processing, pp 179–180 Miller J, Torii M, Vijay-Shanker K (2007) Adaptation of pos tagging for multiple biomedical domains. In: Biological, translational, and clinical language processing, pp 179–180
28.
go back to reference Miller JE, Bloodgood M, Torii M, Vijay-Shanker K (2006) Rapid adaptation of pos tagging for domain specific uses. In: Proceedings of the HLT-NAACL BioNLP workshop on linking natural language and biology. Association for Computational Linguistics, pp 118–119 Miller JE, Bloodgood M, Torii M, Vijay-Shanker K (2006) Rapid adaptation of pos tagging for domain specific uses. In: Proceedings of the HLT-NAACL BioNLP workshop on linking natural language and biology. Association for Computational Linguistics, pp 118–119
29.
go back to reference Miyato T, Dai AM, Goodfellow I (2017) Adversarial training methods for semi-supervised text classification. In: ICLR Miyato T, Dai AM, Goodfellow I (2017) Adversarial training methods for semi-supervised text classification. In: ICLR
30.
go back to reference Owoputi O, O’Connor B, Dyer C, Gimpel K, Schneider N, Smith NA (2013) Improved part-of-speech tagging for online conversational text with word clusters. In: Proceedings of the 2013 conference of the North American chapter of the association for computational linguistics: human language technologies, pp 380–390 Owoputi O, O’Connor B, Dyer C, Gimpel K, Schneider N, Smith NA (2013) Improved part-of-speech tagging for online conversational text with word clusters. In: Proceedings of the 2013 conference of the North American chapter of the association for computational linguistics: human language technologies, pp 380–390
31.
go back to reference Pan SJ, Yang Q (2009) A survey on transfer learning. IEEE Trans Knowl Data Eng 22(10):1345–1359CrossRef Pan SJ, Yang Q (2009) A survey on transfer learning. IEEE Trans Knowl Data Eng 22(10):1345–1359CrossRef
33.
go back to reference Schnabel T, Schütze H (2013) Towards robust cross-domain domain adaptation for part-of-speech tagging. In: Proceedings of the sixth international joint conference on natural language processing. Asian Federation of Natural Language Processing, Nagoya, Japan, pp 198–206. https://www.aclweb.org/anthology/I13-1023 Schnabel T, Schütze H (2013) Towards robust cross-domain domain adaptation for part-of-speech tagging. In: Proceedings of the sixth international joint conference on natural language processing. Asian Federation of Natural Language Processing, Nagoya, Japan, pp 198–206. https://​www.​aclweb.​org/​anthology/​I13-1023
34.
go back to reference Schnabel T, Schütze H (2014) Flors: fast and simple domain adaptation for part-of-speech tagging. Trans Assoc Comput Linguist 2:15–26CrossRef Schnabel T, Schütze H (2014) Flors: fast and simple domain adaptation for part-of-speech tagging. Trans Assoc Comput Linguist 2:15–26CrossRef
35.
go back to reference Søgaard A (2013) Part-of-speech tagging with antagonistic adversaries. In: Proceedings of the 51st annual meeting of the association for computational linguistics (vol 2: Short Papers), pp 640–644 Søgaard A (2013) Part-of-speech tagging with antagonistic adversaries. In: Proceedings of the 51st annual meeting of the association for computational linguistics (vol 2: Short Papers), pp 640–644
38.
go back to reference Xiao M, Guo Y (2013) Domain adaptation for sequence labeling tasks with a probabilistic language adaptation model. In: International conference on machine learning, pp 293–301 Xiao M, Guo Y (2013) Domain adaptation for sequence labeling tasks with a probabilistic language adaptation model. In: International conference on machine learning, pp 293–301
40.
go back to reference Zennaki O, Semmar N, Besacier L (2019) A neural approach for inducing multilingual resources and natural language processing tools for low-resource languages. Nat Lang Eng 25(1):43–67CrossRef Zennaki O, Semmar N, Besacier L (2019) A neural approach for inducing multilingual resources and natural language processing tools for low-resource languages. Nat Lang Eng 25(1):43–67CrossRef
41.
go back to reference Zhang M, Zhang Y, Che W, Liu T (2014) Type-supervised domain adaptation for joint segmentation and pos-tagging. In: Proceedings of the 14th conference of the European chapter of the association for computational linguistics, pp 588–597 Zhang M, Zhang Y, Che W, Liu T (2014) Type-supervised domain adaptation for joint segmentation and pos-tagging. In: Proceedings of the 14th conference of the European chapter of the association for computational linguistics, pp 588–597
Metadata
Title
Domain Adaptation for POS Tagging with Contrastive Monotonic Chunk-wise Attention
Authors
Rajesh Kumar Mundotiya
Arpit Mehta
Rupjyoti Baruah
Publication date
02-02-2022
Publisher
Springer US
Published in
Neural Processing Letters / Issue 4/2022
Print ISSN: 1370-4621
Electronic ISSN: 1573-773X
DOI
https://doi.org/10.1007/s11063-022-10746-4

Other articles of this Issue 4/2022

Neural Processing Letters 4/2022 Go to the issue