Skip to main content

2021 | OriginalPaper | Buchkapitel

A Novel Distributed Reinforcement Learning Method for Classical Chinese Poetry Generation

verfasst von : Liangliang Ma, Hong Shen, Shangsong Liang

Erschienen in: Parallel and Distributed Computing, Applications and Technologies

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Poetry generation has been a classic natural language generation task recently. But so far the methods for this topic mainly imitate and reproduce the poems on the training data set, which indicates that they either have not much connotation or overfit too much like plagiarism of the existing poems. To solve this problem, unlike previous work, instead of tuning the trade-off between connotation and innovation, we propose a distributed reinforcement learning framework, which consists of two stages of training, to generate creative and meaningful poetry. At the first stage we train a model in parallel on a large poetry corpus at word level to master how poets write poems. At the second stage we train the model with a distributed architecture to learn how connotation is developed in human literary art works at sentence level and force the model to imitate itself when it composes some ‘good poems’ to further improve performance. Experiments on generating classical Chinese poetry demonstrate that the proposed model is able to achieve better performance and the high efficiency of training compared to the state-of-the-art.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Fußnoten
1
The data sets are publicly available from: https://​github.​com/​chinese-poetry/​chinese-poetry.
 
2
For GPT-based method, User may have to register a Wechat account and add https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-69244-5_3/MediaObjects/504137_1_En_3_Figb_HTML.gif or https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-69244-5_3/MediaObjects/504137_1_En_3_Figc_HTML.gif .
 
3
seqGAN code is available from: https://​github.​com/​LantaoYu/​SeqGAN.
 
5
The Natural Language Processing Group at the Department of Computer Science and Technology, Tsinghua University.
 
Literatur
1.
Zurück zum Zitat Chen, H., Yi, X., Sun, M., Li, W., Yang, C., Guo, Z.: Sentiment-controllable Chinese poetry generation. In: IJCAI, pp. 4925–4931 (2019) Chen, H., Yi, X., Sun, M., Li, W., Yang, C., Guo, Z.: Sentiment-controllable Chinese poetry generation. In: IJCAI, pp. 4925–4931 (2019)
2.
Zurück zum Zitat Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)CrossRef Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)CrossRef
3.
Zurück zum Zitat Horgan, D., et al.: Distributed prioritized experience replay (2018) Horgan, D., et al.: Distributed prioritized experience replay (2018)
4.
Zurück zum Zitat Liang, S.: Unsupervised semantic generative adversarial networks for expert retrieval. In: WWW (2019) Liang, S.: Unsupervised semantic generative adversarial networks for expert retrieval. In: WWW (2019)
5.
Zurück zum Zitat Liao, Y., Wang, Y., Liu, Q., Jiang, X.: Gpt-based generation for classical chinese poetry. arXiv preprint arXiv:1907.00151 (2019) Liao, Y., Wang, Y., Liu, Q., Jiang, X.: Gpt-based generation for classical chinese poetry. arXiv preprint arXiv:​1907.​00151 (2019)
6.
Zurück zum Zitat Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. Computer ence (2013) Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. Computer ence (2013)
7.
Zurück zum Zitat Mikolov, T., Karafiát, M., Burget, L., Cernock, J., Khudanpur, S.: Recurrent neural network based language model. In: INTERSPEECH 2010 (2010) Mikolov, T., Karafiát, M., Burget, L., Cernock, J., Khudanpur, S.: Recurrent neural network based language model. In: INTERSPEECH 2010 (2010)
10.
Zurück zum Zitat Radford, A., Narasimhan, K., Salimans, T., Sutskever, I.: Improving language understanding by generative pre-training (2018) Radford, A., Narasimhan, K., Salimans, T., Sutskever, I.: Improving language understanding by generative pre-training (2018)
11.
Zurück zum Zitat Schulman, J., Wolski, F., Dhariwal, P., Radford, A., Klimov, O.: Proximal policy optimization algorithms (2017) Schulman, J., Wolski, F., Dhariwal, P., Radford, A., Klimov, O.: Proximal policy optimization algorithms (2017)
12.
Zurück zum Zitat Song, Y., Shi, S., Li, J., Zhang, H.: Directional skip-gram: explicitly distinguishing left and right context for word embeddings. In: Proceedings of ACL 2018 (2018) Song, Y., Shi, S., Li, J., Zhang, H.: Directional skip-gram: explicitly distinguishing left and right context for word embeddings. In: Proceedings of ACL 2018 (2018)
14.
Zurück zum Zitat Sutton, R., Barto, A.: Reinforcement Learning: An Introduction. MIT Press, United States (1998)MATH Sutton, R., Barto, A.: Reinforcement Learning: An Introduction. MIT Press, United States (1998)MATH
16.
Zurück zum Zitat Yan, R., Jiang, H., Lapata, M., Lin, S.D., Lv, X., Li, X.: I, poet: automatic chinese poetry composition through a generative summarization framework under constrained optimization. In: 23rd IJCAI (2013) Yan, R., Jiang, H., Lapata, M., Lin, S.D., Lv, X., Li, X.: I, poet: automatic chinese poetry composition through a generative summarization framework under constrained optimization. In: 23rd IJCAI (2013)
17.
Zurück zum Zitat Yi, X., Li, R., Sun, M.: Chinese poetry generation with a salient-clue mechanism. CoNLL, 241–250 (2018) Yi, X., Li, R., Sun, M.: Chinese poetry generation with a salient-clue mechanism. CoNLL, 241–250 (2018)
18.
Zurück zum Zitat Yi, X., Sun, M., Li, R., Li, W.: Automatic poetry generation with mutual reinforcement learning. Proc. EMNLP 2018, 3143–3153 (2018) Yi, X., Sun, M., Li, R., Li, W.: Automatic poetry generation with mutual reinforcement learning. Proc. EMNLP 2018, 3143–3153 (2018)
19.
Zurück zum Zitat Yi, X., Sun, M., Li, R., Zonghan, Y.: Chinese poetry generation with a working memory model, September 2018 Yi, X., Sun, M., Li, R., Zonghan, Y.: Chinese poetry generation with a working memory model, September 2018
20.
Zurück zum Zitat Yu, L., Zhang, W., Wang, J., Yu, Y.: Seqgan: sequence generative adversarial nets with policy gradient. In: AAAI-17 (2017) Yu, L., Zhang, W., Wang, J., Yu, Y.: Seqgan: sequence generative adversarial nets with policy gradient. In: AAAI-17 (2017)
21.
Zurück zum Zitat Zhipeng, G., et al.: Jiuge: a human-machine collaborative chinese classical poetry generation system. In: Proceedings of ACL 2019: System Demonstrations, pp. 25–30 (2019) Zhipeng, G., et al.: Jiuge: a human-machine collaborative chinese classical poetry generation system. In: Proceedings of ACL 2019: System Demonstrations, pp. 25–30 (2019)
22.
Zurück zum Zitat Zinkevich, M., Weimer, M., Smola, A.J., Li, L.: Parallelized stochastic gradient descent. In: Proceedings of NIPS 2010 (2011) Zinkevich, M., Weimer, M., Smola, A.J., Li, L.: Parallelized stochastic gradient descent. In: Proceedings of NIPS 2010 (2011)
Metadaten
Titel
A Novel Distributed Reinforcement Learning Method for Classical Chinese Poetry Generation
verfasst von
Liangliang Ma
Hong Shen
Shangsong Liang
Copyright-Jahr
2021
DOI
https://doi.org/10.1007/978-3-030-69244-5_3