Skip to main content
Top

2018 | OriginalPaper | Chapter

Text Rewriting Pattern Mining Based on Monolingual Alignment

Authors : Yuxiang Jia, Lu Wang, Hongying Zan

Published in: Chinese Lexical Semantics

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Text rewriting pattern mining was important for stylistic change detection and machine (aided) writing. This paper combined monolingual sentence alignment and monolingual word alignment for text rewriting pattern mining. Edit distance was used to compute sentence similarity for sentence alignment, and a log-linear modification of IBM Model 2 was used for word alignment. We built a rewriting corpus of Jin Yong’s novels, on which quantitative and qualitative experiments were carried out. Rewriting patterns were extracted and classified, including function word usages and some content word usages, which reflected the stylistic shift of the author.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Ho, Y.: Corpus Stylistics in Principles and Practice: A Stylistic Exploration of John Fowles’ The Magus. A&C Black (2011) Ho, Y.: Corpus Stylistics in Principles and Practice: A Stylistic Exploration of John Fowles’ The Magus. A&C Black (2011)
2.
go back to reference Lan, D.H., Cao, L.Y.: On the new revised version of Jin Yong’s novels. J. Hangzhou Dianzi Univ. ( Soc. Sci.) 6(1), 57–61 (2010). (in Chinese) Lan, D.H., Cao, L.Y.: On the new revised version of Jin Yong’s novels. J. Hangzhou Dianzi Univ. ( Soc. Sci.) 6(1), 57–61 (2010). (in Chinese)
3.
go back to reference Xue, D.C.: The edition research of the legend of the condor heroes. Henan University (2011). (in Chinese) Xue, D.C.: The edition research of the legend of the condor heroes. Henan University (2011). (in Chinese)
4.
go back to reference Zhang, F., Litman, D.: Sentence-level rewriting detection. In: Proceedings of the Ninth Workshop on Innovative Use of NLP for Building Educational Applications, pp. 149–154 (2014) Zhang, F., Litman, D.: Sentence-level rewriting detection. In: Proceedings of the Ninth Workshop on Innovative Use of NLP for Building Educational Applications, pp. 149–154 (2014)
5.
go back to reference Zhang, F., Hashemi, H.B., Hwa, R., et al.: A corpus of annotated revisions for studying argumentative writing. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1568–1578 (2017) Zhang, F., Hashemi, H.B., Hwa, R., et al.: A corpus of annotated revisions for studying argumentative writing. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1568–1578 (2017)
6.
go back to reference Tan, P.P., Verspoor, K., Miller, T.: Structural alignment as the basis to improve significant change detection in versioned sentences. In: Proceedings of the Australasian Language Technology Association Workshop 2015, pp. 101–109 (2015) Tan, P.P., Verspoor, K., Miller, T.: Structural alignment as the basis to improve significant change detection in versioned sentences. In: Proceedings of the Australasian Language Technology Association Workshop 2015, pp. 101–109 (2015)
7.
go back to reference Navarro, G.: A guided tour to approximate string matching. ACM Comput. Surv. 33(1), 31–88 (2001)CrossRef Navarro, G.: A guided tour to approximate string matching. ACM Comput. Surv. 33(1), 31–88 (2001)CrossRef
8.
go back to reference Nelken, R., Shieber, S.M.: Towards robust context-sensitive sentence alignment for monolingual corpora. In: Proceedings of EACL 2006, pp. 161–168 (2006) Nelken, R., Shieber, S.M.: Towards robust context-sensitive sentence alignment for monolingual corpora. In: Proceedings of EACL 2006, pp. 161–168 (2006)
9.
go back to reference Barzilay, R., Elhadad, N.: Sentence alignment for monolingual comparable corpora. In: Proceedings of EMNLP 2003, pp. 25–32 (2003) Barzilay, R., Elhadad, N.: Sentence alignment for monolingual comparable corpora. In: Proceedings of EMNLP 2003, pp. 25–32 (2003)
10.
go back to reference Liu, Z.Y., Wang, H.F., Wu, H., et al.: Collocation extraction using monolingual word alignment method. In: Proceedings of EMNLP 2009, vol. 2, pp. 487–495 (2009) Liu, Z.Y., Wang, H.F., Wu, H., et al.: Collocation extraction using monolingual word alignment method. In: Proceedings of EMNLP 2009, vol. 2, pp. 487–495 (2009)
11.
go back to reference Dyer, C., Chahuneau, V., Smith, N.A.: A simple, fast, and effective reparameterization of IBM model 2. In: Proceedings of NAACL 2013, pp. 644–648 (2013) Dyer, C., Chahuneau, V., Smith, N.A.: A simple, fast, and effective reparameterization of IBM model 2. In: Proceedings of NAACL 2013, pp. 644–648 (2013)
Metadata
Title
Text Rewriting Pattern Mining Based on Monolingual Alignment
Authors
Yuxiang Jia
Lu Wang
Hongying Zan
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-030-04015-4_47

Premium Partner