Skip to main content
Top

2021 | OriginalPaper | Chapter

Data Collection Design for Dialogue Systems for Low-Resource Languages

Authors : Zulipiye Yusupujiang, Jonathan Ginzburg

Published in: Conversational Dialogue Systems for the Next Decade

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This paper presents our plan and initial design for constructing a dialogue corpus for a low resource language, in this case Uyghur, with the ultimate goal of developing a dialogue system for Uyghur. We plan to design and create a Massively multiplayer online role-playing game (MMORPG), using the RPG Maker MV Game Engine. We also introduce our initial design of a method for collecting various types of naturally generated questions and answers from native Uyghur speakers. Our method and the design of the game can be used for other low resource languages for collecting a large amount of dialogue data, which is crucial for implementing a dialogue system for such languages.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Rouzi A, Yin S, Zhang Z, Wang D, Hamdulla A, Zheng F (2017) Thuyg-20: a free uyghur speech database. J Tsinghua Univ (Sci Technol) 57(2):182–187 Rouzi A, Yin S, Zhang Z, Wang D, Hamdulla A, Zheng F (2017) Thuyg-20: a free uyghur speech database. J Tsinghua Univ (Sci Technol) 57(2):182–187
3.
go back to reference Ho C-J, Chang T-H, Lee J-C, Hsu JY, Chen K-T (2009) Kisskissban: a competitive human computation game for image annotation. In Proceedings of the ACM SIGKDD workshop on human computation, pp 11–14 Ho C-J, Chang T-H, Lee J-C, Hsu JY, Chen K-T (2009) Kisskissban: a competitive human computation game for image annotation. In Proceedings of the ACM SIGKDD workshop on human computation, pp 11–14
4.
go back to reference Poesio M, Chamberlain J, Kruschwitz U, Robaldo L, Ducceschi L (2013) Phrase detectives: utilizing collective intelligence for internet-scale language resource creation. ACM Trans Interact Intell Syst (TiiS) 3(1):1–44CrossRef Poesio M, Chamberlain J, Kruschwitz U, Robaldo L, Ducceschi L (2013) Phrase detectives: utilizing collective intelligence for internet-scale language resource creation. ACM Trans Interact Intell Syst (TiiS) 3(1):1–44CrossRef
5.
go back to reference Bartle RA (2004) Designing virtual worlds. New Riders Bartle RA (2004) Designing virtual worlds. New Riders
6.
go back to reference Healey PGT, Purver M, King J, Ginzburg J, Mills G (2003) Experimenting with clarification in dialogue. In: Alterman R, Kirsh D (eds) Proceedings of the 25th annual conference of the cognitive science society, LEA, Mahwah, N.J., pp 539–544 Healey PGT, Purver M, King J,  Ginzburg J, Mills G (2003) Experimenting with clarification in dialogue. In: Alterman R, Kirsh D (eds) Proceedings of the 25th annual conference of the cognitive science society, LEA, Mahwah, N.J., pp 539–544
7.
go back to reference Eshghi A, Healey PGT (2016) Collective contexts in conversation: grounding by proxy. Cogn Sci 40(2):299–324CrossRef Eshghi A, Healey PGT (2016) Collective contexts in conversation: grounding by proxy. Cogn Sci 40(2):299–324CrossRef
8.
go back to reference Łupkowski P, Ginzburg J (2016) Query responses. J Lang Modell 4(2):245–292CrossRef Łupkowski P, Ginzburg J (2016) Query responses. J Lang Modell 4(2):245–292CrossRef
9.
10.
go back to reference Ginzburg J, Yusupujiang Z, Li C, Ren K, Łupkowski P (2019) Characterizing the response space of questions: a corpus study for English and polish. In Proceedings of the 20th annual SIGdial meeting on discourse and dialogue, pp 320–330 Ginzburg J, Yusupujiang Z, Li C, Ren K, Łupkowski P (2019) Characterizing the response space of questions: a corpus study for English and polish. In Proceedings of the 20th annual SIGdial meeting on discourse and dialogue, pp 320–330
11.
go back to reference Larsson S, Berman A (2016) Domain-specific and general syntax and semantics in the talkamatic dialogue manager. Empir Issues Syntax Sem 11:91–110 Larsson S, Berman A (2016) Domain-specific and general syntax and semantics in the talkamatic dialogue manager. Empir Issues Syntax Sem 11:91–110
12.
go back to reference Maraev V, Ginzburg J, Larsson S, Tian Y, Bernardy J-P (2018) Towards KOS/TTR-based proof-theoretic dialogue management. In: Proceedings of SemDial Maraev V, Ginzburg J, Larsson S, Tian Y, Bernardy J-P (2018) Towards KOS/TTR-based proof-theoretic dialogue management. In: Proceedings of SemDial
Metadata
Title
Data Collection Design for Dialogue Systems for Low-Resource Languages
Authors
Zulipiye Yusupujiang
Jonathan Ginzburg
Copyright Year
2021
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-15-8395-7_30