Skip to main content

2012 | OriginalPaper | Buchkapitel

3. Data-Driven Methods for Spoken Language Understanding

verfasst von : James Henderson, Filip Jurčíček

Erschienen in: Data-Driven Methods for Adaptive Spoken Dialogue Systems

Verlag: Springer New York

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Spoken dialogue systems need to be able to interpret the spoken input from theuser. This is done by mapping the user’s spoken utterance to a representation ofthe meaning of that utterance, and then passing this representation to thedialogue manager. This process begins with the application of automatic speechrecognition (ASR) technology, which maps the speech to hypotheses about thesequence of words in the utterance. It is then the job of spoken languageunderstanding (SLU) to map the word recognition hypotheses to hypothesisedmeanings. The representation of this meaning is called the semantics of theutterance.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
Similar experiments are reported in [6], where generative models are reranked with discriminative models, and in [7], where they focus on the advantages of discriminative reranking for small datasets.
 
Literatur
1.
Zurück zum Zitat Bonneau-Maynard, H., Ayache, C., Bechet, F., Denis, A., Kuhn, A., Lefvre, F., Mostefa, D., Qugnard, M., Rosset, S., Servan, J., Vilaneau, S.: Results of the French Evalda-Media evaluation campaign for literal understanding. In: Proceedins of the International Conference on Language Resources and Evaluation (LREC), pp. 2054–2059, 2006 Bonneau-Maynard, H., Ayache, C.,  Bechet, F.,  Denis, A.,  Kuhn, A.,  Lefvre, F., Mostefa, D., Qugnard, M., Rosset, S., Servan, J., Vilaneau, S.: Results of the French Evalda-Media evaluation campaign for literal understanding. In: Proceedins of the International Conference on Language Resources and Evaluation (LREC), pp. 2054–2059, 2006
2.
Zurück zum Zitat Brill, E.: Transformation-based Error-driven Learning and natural language processing: A case study in Part-of-Speech Tagging. Computational Linguistics 21(4), 543–565 (1995) Brill, E.: Transformation-based Error-driven Learning and natural language processing: A case study in Part-of-Speech Tagging. Computational Linguistics 21(4), 543–565 (1995)
3.
Zurück zum Zitat Briscoe, E., Carroll, J., Watson, R.: The second release of the RASP system. In: Proceedings of COLING/ACL, 2006 Briscoe, E., Carroll, J., Watson, R.: The second release of the RASP system. In: Proceedings of COLING/ACL, 2006
4.
Zurück zum Zitat Coppola, B., Moschitti, A., Riccardi, G.: Shallow semantic parsing for spoken language understanding. In: Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers, pp. 85–88, 2009 Coppola, B., Moschitti, A., Riccardi, G.: Shallow semantic parsing for spoken language understanding. In: Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers, pp. 85–88, 2009
5.
Zurück zum Zitat Dahl, D.A., Bates, M., Brown, M., Fisher, W., Hunicke-Smith, K., Pallett, D., Pao, C., Rudnicky, A., Shriberg, E.: Expanding the scope of the ATIS task: The ATIS-3 corpus. In: Proceedings of the ARPA HLT Workshop, 1994 Dahl, D.A., Bates, M., Brown, M., Fisher, W., Hunicke-Smith,  K., Pallett, D., Pao, C., Rudnicky, A., Shriberg, E.: Expanding the scope of the ATIS task: The ATIS-3 corpus. In: Proceedings of the ARPA HLT Workshop, 1994
6.
Zurück zum Zitat Dinarelli, M., A. Moschitti, Riccardi, G.: Re-ranking models for spoken language understanding. In: Proceedings of the 12th Conference of the European Chapter of the ACL (EACL 2009), pp. 202–210, 2009 Dinarelli, M., A. Moschitti, Riccardi, G.: Re-ranking models for spoken language understanding. In: Proceedings of the 12th Conference of the European Chapter of the ACL (EACL 2009), pp. 202–210, 2009
7.
Zurück zum Zitat Dinarelli, M., Moschitti, A., Riccardi, G.: Re-ranking models based-on small training data for spoken language understanding. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, pp. 1076–1085, 2009 Dinarelli, M., Moschitti, A., Riccardi, G.: Re-ranking models based-on small training data for spoken language understanding. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, pp. 1076–1085, 2009
8.
Zurück zum Zitat Dinarelli, M., Moschitti, A., Riccardi, G.: Discriminative reranking for spoken language understanding. IEEE Transactions on Audio, Speech, and Language Processing 20(2), 526 –539 (2012) Dinarelli, M., Moschitti, A.,  Riccardi, G.: Discriminative reranking for spoken language understanding. IEEE Transactions on Audio, Speech, and Language Processing 20(2), 526 –539 (2012)
9.
Zurück zum Zitat Dinarelli, M., Quarteroni, S., Tonelli, S., Moschitti, A., Riccardi, G.: Annotating spoken dialogs: From speech segments to dialog acts and frame semantics. In: Proceedings of SRSL 2009, the 2nd Workshop on Semantic Representation of Spoken Language, pp. 34–41, 2009 Dinarelli, M., Quarteroni, S., Tonelli, S., Moschitti, A., Riccardi, G.: Annotating spoken dialogs: From speech segments to dialog acts and frame semantics. In: Proceedings of SRSL 2009, the 2nd Workshop on Semantic Representation of Spoken Language, pp. 34–41, 2009
10.
Zurück zum Zitat Hahn, S., Dinarelli, M., Raymond, C., Lefevre, F., Lehnen, P., De Mori, R., Moschitti, A., Ney, H., Riccardi, G.: Comparing Stochastic Approaches to Spoken Language Understanding in Multiple Languages. IEEE Transactions on Audio, Speech, and Language Processing 19(6), 1569–1583 (2011)CrossRef Hahn, S., Dinarelli, M., Raymond, C., Lefevre, F., Lehnen, P., De Mori, R., Moschitti, A., Ney, H., Riccardi, G.: Comparing Stochastic Approaches to Spoken Language Understanding in Multiple Languages. IEEE Transactions on Audio, Speech, and Language Processing 19(6), 1569–1583 (2011)CrossRef
11.
Zurück zum Zitat Hajič, J., Ciaramita, M., Johansson, R., Kawahara, D., Martí, M., Màrquez, L., Meyers, A., Nivre, J., Padó, S., Štěpánek, J., Straňák, P., Surdeanu, M., Xue, N., Zhang, Y.: The conll-2009 shared task: Syntactic and semantic dependencies in multiple languages. In: Proceedings of the Thirteenth Conference on Computational Natural Language Learning (CoNLL 2009): Shared Task, pp. 1–18. Boulder, Colorado, June 2009 Hajič, J., Ciaramita, M., Johansson, R., Kawahara, D., Martí, M., Màrquez, L., Meyers, A., Nivre, J., Padó, S., Štěpánek, J., Straňák, P., Surdeanu, M., Xue, N.,  Zhang, Y.: The conll-2009 shared task: Syntactic and semantic dependencies in multiple languages. In: Proceedings of the Thirteenth Conference on Computational Natural Language Learning (CoNLL 2009): Shared Task, pp. 1–18. Boulder, Colorado, June 2009
12.
Zurück zum Zitat He, Y., Young, S.: Hidden vector state model for hierarchical semantic parsing. In: Proceedings of ICASSP, Hong Kong (2003) He, Y., Young, S.: Hidden vector state model for hierarchical semantic parsing. In: Proceedings of ICASSP, Hong Kong (2003)
13.
Zurück zum Zitat He, Y., Young, S.: Semantic processing using the Hidden Vector State model. Computer Speech & Language 19(1), 85–106 (2005)CrossRef He, Y., Young, S.: Semantic processing using the Hidden Vector State model. Computer Speech & Language 19(1), 85–106 (2005)CrossRef
14.
Zurück zum Zitat Henderson, J.: Inducing history representations for broad coverage statistical parsing. In: Proc. joint meeting of North American Chapter of the Association for Computational Linguistics and the Human Language Technology Conf., pp. 103–110. Edmonton, Canada (2003) Henderson, J.: Inducing history representations for broad coverage statistical parsing. In: Proc. joint meeting of North American Chapter of the Association for Computational Linguistics and the Human Language Technology Conf., pp. 103–110. Edmonton, Canada (2003)
15.
Zurück zum Zitat Henderson, J.: Semantic decoder which exploits syntactic-semantic parsing, for the towninfo task. Technical Report Deliverable 2.2, CLASSiC Project, 2009 Henderson, J.: Semantic decoder which exploits syntactic-semantic parsing, for the towninfo task. Technical Report Deliverable 2.2, CLASSiC Project, 2009
16.
Zurück zum Zitat Jurčíček, F., Gašić, M., Keizer, S., Mairesse, F., Thomson, B., Yu, K., Young, S.: Transformation-based learning for semantic parsing. In: Proceedings of Interspeech, pp. 2719–2722. ISCA, 2009 Jurčíček, F., Gašić, M., Keizer, S., Mairesse, F., Thomson, B., Yu, K., Young, S.: Transformation-based learning for semantic parsing. In: Proceedings of Interspeech, pp. 2719–2722. ISCA, 2009
17.
Zurück zum Zitat Kate, R.J., Wong, Y.W., Mooney, R.J.: Learning to transform natural to formal languages. In: Proceedings of AAAI, 2005 Kate, R.J., Wong, Y.W., Mooney, R.J.: Learning to transform natural to formal languages. In: Proceedings of AAAI, 2005
18.
Zurück zum Zitat Kate, R.J.: A dependency-based word subsequence kernel. In: Proceedings of EMNLP, 2008 Kate, R.J.: A dependency-based word subsequence kernel. In: Proceedings of EMNLP, 2008
19.
Zurück zum Zitat Mairesse, F.: Training tools and semantic decoder for the towninfo task: D2.1 (prototype). Technical Report D2.1, CLASSiC, February 2009 Mairesse, F.: Training tools and semantic decoder for the towninfo task: D2.1 (prototype). Technical Report D2.1, CLASSiC, February 2009
20.
Zurück zum Zitat Mairesse, F., Gašić, M., Jurčíček, F., Keizer, S., Thomson, B., Yu, K., Young, S.: Spoken language understanding from unaligned data using discriminative classification models. In: Proceedings of ICASSP, 2009 Mairesse, F., Gašić, M.,  Jurčíček, F., Keizer, S.,   Thomson, B.,  Yu, K.,  Young, S.: Spoken language understanding from unaligned data using discriminative classification models. In: Proceedings of ICASSP, 2009
21.
Zurück zum Zitat Marcus, M.P., Santorini, B., Marcinkiewicz, M.A.: Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics 19(2), 313–330 (1993) Marcus, M.P., Santorini, B., Marcinkiewicz, M.A.: Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics 19(2), 313–330 (1993)
22.
Zurück zum Zitat Merlo, P., Musillo, G.: Semantic parsing for high-precision semantic role labelling. In: Proceedings of the 20th Conference on Computational Natural Language Learning (CoNLL 2008), Manchester, UK (2008) Merlo, P., Musillo, G.: Semantic parsing for high-precision semantic role labelling. In: Proceedings of the 20th Conference on Computational Natural Language Learning (CoNLL 2008), Manchester, UK (2008)
23.
Zurück zum Zitat Meza-Ruiz, I.V., Riedel, S., Lemon, O.: Spoken Language Understanding in dialogue systems, using a 2-layer Markov Logic Network: Improving semantic accuracy. In: Proceedings of Londial, 2008 Meza-Ruiz, I.V., Riedel, S., Lemon, O.: Spoken Language Understanding in dialogue systems, using a 2-layer Markov Logic Network: Improving semantic accuracy. In: Proceedings of Londial, 2008
24.
Zurück zum Zitat Palmer, M., Gildea, D., Kingsbury, P.: The proposition bank: An annotated corpus of semantic roles. Computational Linguistics 31(1), 71–106 (2005)CrossRef Palmer, M., Gildea, D., Kingsbury, P.: The proposition bank: An annotated corpus of semantic roles. Computational Linguistics 31(1), 71–106 (2005)CrossRef
25.
Zurück zum Zitat van der Plas, L., Henderson, J., Merlo, P.: Domain adaptation with artificial data for semantic parsing of speech. In: Proceedings of the 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers, pp. 125–128. Boulder, Colorado, June 2009 van der Plas, L., Henderson, J., Merlo, P.: Domain adaptation with artificial data for semantic parsing of speech. In: Proceedings of the 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers, pp. 125–128. Boulder, Colorado, June 2009
26.
Zurück zum Zitat Ruppenhofer, J., Ellsworth, M., Petruck, M., Johnson, C., Scheffczyk, J.: Framenet ii: Extended theory and practice. Technical report, Berkeley, CA (2010) Ruppenhofer, J., Ellsworth, M., Petruck, M., Johnson, C., Scheffczyk, J.: Framenet ii: Extended theory and practice. Technical report, Berkeley, CA (2010)
27.
Zurück zum Zitat Surdeanu, M., Johansson, R., Meyers, A., Màrquez, L., Nivre, J.: The CoNLL-2008 shared task on joint parsing of syntactic and semantic dependencies. In: Proceedings of the 12th Conference on Computational Natural Language Learning (CoNLL-2008), 2008 Surdeanu, M., Johansson, R., Meyers, A., Màrquez, L., Nivre, J.: The CoNLL-2008 shared task on joint parsing of syntactic and semantic dependencies. In: Proceedings of the 12th Conference on Computational Natural Language Learning (CoNLL-2008), 2008
28.
Zurück zum Zitat Thomson, B., Yu, K., Gašić, M., Keizer, S., Mairesse, F., Schatzmann, J., Young, S.: Evaluating semantic-level confidence scores with multiple hypotheses. In: Proceedings of the Ninth Conference of the International Speech Communication Association (INTERSPEECH 2008), pp. 1153–1156. Brisbane, Australia (2008) Thomson, B.,  Yu, K., Gašić, M., Keizer, S.,  Mairesse, F., Schatzmann, J., Young, S.: Evaluating semantic-level confidence scores with multiple hypotheses. In: Proceedings of the Ninth Conference of the International Speech Communication Association (INTERSPEECH 2008), pp. 1153–1156. Brisbane, Australia (2008)
29.
Zurück zum Zitat Thomson, B., Gašić, M., Keizer, S., Mairesse, F., Schatzmann, J., Yu, K., Young, S.: User study of the Bayesian update of dialogue state approach to dialogue management. In: Proceedings of Interspeech, 2008 Thomson, B., Gašić, M., Keizer, S., Mairesse, F., Schatzmann, J., Yu, K., Young, S.: User study of the Bayesian update of dialogue state approach to dialogue management. In: Proceedings of Interspeech, 2008
30.
Zurück zum Zitat Ward, W.: Understanding spontaneous speech: the phoenix system. In: Proceedings of ICASSP, vol. 1, pp. 365–367, 1991 Ward, W.: Understanding spontaneous speech: the phoenix system. In: Proceedings of ICASSP, vol. 1, pp. 365–367, 1991
31.
Zurück zum Zitat Williams, J.: Applying POMDPs to Dialog Systems in the Troubleshooting Domain. In: Proceedings of HLT/NAACL Workshop on Bridging the Gap: Academic and Industrial Research in Dialog Technology Williams, J.: Applying POMDPs to Dialog Systems in the Troubleshooting Domain. In: Proceedings of HLT/NAACL Workshop on Bridging the Gap: Academic and Industrial Research in Dialog Technology
32.
Zurück zum Zitat Wu, W.-L., Lu, R.-Z., Duan, J.-Y., Liu, H., Gao, F., Chen, Y.-Q.: Spoken language understanding using weakly supervised learning. Computer Speech & Language, 24(2), 358–382 (2010)CrossRef Wu, W.-L., Lu, R.-Z., Duan, J.-Y., Liu, H., Gao, F., Chen, Y.-Q.: Spoken language understanding using weakly supervised learning. Computer Speech & Language, 24(2), 358–382 (2010)CrossRef
33.
Zurück zum Zitat Young, S.: CUED standard dialogue acts. Technical report, Cambridge University Engineering Dept., 2007 Young, S.: CUED standard dialogue acts. Technical report, Cambridge University Engineering Dept., 2007
34.
Zurück zum Zitat Zettlemoyer, L.S., Collins, M.: Online learning of relaxed CCG grammars for parsing to logical form. In: Proceedings of EMNLP-CoNLL, 2007 Zettlemoyer, L.S., Collins, M.: Online learning of relaxed CCG grammars for parsing to logical form. In: Proceedings of EMNLP-CoNLL, 2007
35.
Zurück zum Zitat Zhou, D., He, Y.: Discriminative Training of the Hidden Vector State Model for Semantic Parsing. IEEE Transactions on Knowledge and Data Engineering 21(1), 66–77 (2009)MathSciNetCrossRef Zhou, D., He, Y.: Discriminative Training of the Hidden Vector State Model for Semantic Parsing. IEEE Transactions on Knowledge and Data Engineering 21(1), 66–77 (2009)MathSciNetCrossRef
Metadaten
Titel
Data-Driven Methods for Spoken Language Understanding
verfasst von
James Henderson
Filip Jurčíček
Copyright-Jahr
2012
Verlag
Springer New York
DOI
https://doi.org/10.1007/978-1-4614-4803-7_3

Neuer Inhalt