Skip to main content
Top

2017 | OriginalPaper | Chapter

Using Extended Tree Kernel to Recognize Metalanguage in Text

Author : Boris A. Galitsky

Published in: Uncertainty Modeling

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The problem of classifying text with respect to metalanguage and language-object patters is formulated and its application areas are proposed. We extend parse tree kernels from the level of individual sentences towards the level of paragraphs to classify texts at a high level of abstraction. The method targets the text classification tasks where keyword statistics is insufficient for text classification tasks. We build a set of extended trees for a paragraph of text from the individual parse trees for sentences. Conventional parse trees are extended across sentences based on anaphora and rhetoric structure relations between the phrases in different sentences. Tree kernel learning is applied to extended trees to take advantage of additional discourse-related information. We evaluate our approach in the security-related domain of the design documents. These are the documents which contain a formal well-structured presentation on how a system is built. Design documents need to be differentiated from product requirements, architectural, general design notes, templates, research results and other types of documents, which can share the same keywords. We also evaluate classification in the literature domain, classifying text in Kafka’s novel “The Trial” as metalanguage versus novel’s description in scholarly studies as a mixture of metalanguage and language-object.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Aurora, V. 2001. Freudian metaphor and Surrealist metalanguage; in Michel Leiris: The Unconscious and the Sea LittéRéalité, Vol. XIII. Aurora, V. 2001. Freudian metaphor and Surrealist metalanguage; in Michel Leiris: The Unconscious and the Sea LittéRéalité, Vol. XIII.
2.
go back to reference Collins, M., and Duffy, N. 2002. Convolution kernels for natural language. In Proceedings of NIPS, 625–32. Collins, M., and Duffy, N. 2002. Convolution kernels for natural language. In Proceedings of NIPS, 625–32.
3.
go back to reference Croft, B., Metzler, D., Strohman, T. 2009. Search Engines - Information Retrieval in Practice. Pearson Education. North America. Croft, B., Metzler, D., Strohman, T. 2009. Search Engines - Information Retrieval in Practice. Pearson Education. North America.
4.
go back to reference Cumby, C. and Roth D. 2003. On Kernel Methods for Relational Learning. ICML, pp. 107–14. Cumby, C. and Roth D. 2003. On Kernel Methods for Relational Learning. ICML, pp. 107–14.
5.
go back to reference Galitsky, B. 2003. Natural Language Question Answering System: Technique of Semantic Headers. Advanced Knowledge International, Adelaide, Australia. Galitsky, B. 2003. Natural Language Question Answering System: Technique of Semantic Headers. Advanced Knowledge International, Adelaide, Australia.
6.
go back to reference Galitsky, B. 2012. Machine Learning of Syntactic Parse Trees for Search and Classification of Text. Engineering Application of AI. 26(3), 1072–91. Galitsky, B. 2012. Machine Learning of Syntactic Parse Trees for Search and Classification of Text. Engineering Application of AI. 26(3), 1072–91.
7.
go back to reference Galitsky, B. 2013. Transfer learning of syntactic structures for building taxonomies for search engines. Engineering Applications of Artificial Intelligence. Volume 26 Issue 10, pp. 2504–2515. Galitsky, B. 2013. Transfer learning of syntactic structures for building taxonomies for search engines. Engineering Applications of Artificial Intelligence. Volume 26 Issue 10, pp. 2504–2515.
8.
go back to reference Galitsky, B. 2014. Learning parse structure of paragraphs and its applications in search. Engineering Applications of Artificial Intelligence. 32, 160-84. Galitsky, B. 2014. Learning parse structure of paragraphs and its applications in search. Engineering Applications of Artificial Intelligence. 32, 160-84.
9.
go back to reference Galitsky, B., Kuznetsov S. 2008. Learning communicative actions of conflicting human agents. J. Exp. Theor. Artif. Intell. 20(4): 277–317. Galitsky, B., Kuznetsov S. 2008. Learning communicative actions of conflicting human agents. J. Exp. Theor. Artif. Intell. 20(4): 277–317.
10.
go back to reference Galitsky, B., Josep-Lluis de la Rosa, and Boris Kovalerchuk. 2011. Assessing plausibility of explanation and meta-explanation in inter-human conflict. Engineering Application of AI, 24(8), 1472–1486. Galitsky, B., Josep-Lluis de la Rosa, and Boris Kovalerchuk. 2011. Assessing plausibility of explanation and meta-explanation in inter-human conflict. Engineering Application of AI, 24(8), 1472–1486.
11.
go back to reference Galitsky, B., de la Rosa JL, Dobrocsi, G. 2012. Inferring the semantic properties of sentences by mining syntactic parse trees. Data & Knowledge Engineering. 81–82, 21–45. Galitsky, B., de la Rosa JL, Dobrocsi, G. 2012. Inferring the semantic properties of sentences by mining syntactic parse trees. Data & Knowledge Engineering. 81–82, 21–45.
13.
go back to reference Galitsky, B., Usikov, D., and Kuznetsov S.O. 2013. Parse Thicket Representations for Answering Multi-sentence questions. 20th International Conference on Conceptual Structures, ICCS 2013. Galitsky, B., Usikov, D., and Kuznetsov S.O. 2013. Parse Thicket Representations for Answering Multi-sentence questions. 20th International Conference on Conceptual Structures, ICCS 2013.
14.
go back to reference Galitsky, B., Ilvovsky, D., Kuznetsov SO, and Strok, F. 2013. Improving Text Retrieval Efficiency with Pattern Structures on Parse Thickets, in Workshop Formal Concept Analysis meets Information Retrieval at ECIR 2013, Moscow, Russia. Galitsky, B., Ilvovsky, D., Kuznetsov SO, and Strok, F. 2013. Improving Text Retrieval Efficiency with Pattern Structures on Parse Thickets, in Workshop Formal Concept Analysis meets Information Retrieval at ECIR 2013, Moscow, Russia.
15.
go back to reference Haussler, D. 1999. Convolution kernels on discrete structures. UCSB Technical report. Haussler, D. 1999. Convolution kernels on discrete structures. UCSB Technical report.
16.
go back to reference John, G.H. and Langley, P. 1995. Estimating Continuous Distributions in Bayesian Classifiers. In Eleventh Conference on Uncertainty in Artificial Intelligence, San Mateo, 338–45. John, G.H. and Langley, P. 1995. Estimating Continuous Distributions in Bayesian Classifiers. In Eleventh Conference on Uncertainty in Artificial Intelligence, San Mateo, 338–45.
17.
go back to reference Kohavi, R. 1995. A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection. International Joint Conference on Artificial Intelligence. 1137–43. Kohavi, R. 1995. A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection. International Joint Conference on Artificial Intelligence. 1137–43.
18.
go back to reference Kong, F. and Zhou, G. 2011. Improve Tree Kernel-Based Event Pronoun Resolution with Competitive Information. Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence, 3 1814–19. Kong, F. and Zhou, G. 2011. Improve Tree Kernel-Based Event Pronoun Resolution with Competitive Information. Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence, 3 1814–19.
19.
go back to reference Lee, H., Chang, A., Peirsman, Y., Chambers, N., Surdeanu, M. and Jurafsky, D. 2013. Deterministic coreference resolution based on entity-centric, precision-ranked rules. Computational Linguistics 39(4), 885–916. Lee, H., Chang, A., Peirsman, Y., Chambers, N., Surdeanu, M. and Jurafsky, D. 2013. Deterministic coreference resolution based on entity-centric, precision-ranked rules. Computational Linguistics 39(4), 885–916.
20.
go back to reference Mann, W., Matthiessen, C. and Thompson, S. 1992. Rhetorical Structure Theory and Text Analysis. Discourse Description: Diverse linguistic analyses of a fund-raising text. ed. by Mann, W. and Thompson, S.; Amsterdam, John Benjamins. pp. 39–78. Mann, W., Matthiessen, C. and Thompson, S. 1992. Rhetorical Structure Theory and Text Analysis. Discourse Description: Diverse linguistic analyses of a fund-raising text. ed. by Mann, W. and Thompson, S.; Amsterdam, John Benjamins. pp. 39–78.
21.
go back to reference Michael, T., Cox, M.T., and Anita Raja. 2007. Metareasoning: A manifesto. Michael, T., Cox, M.T., and Anita Raja. 2007. Metareasoning: A manifesto.
22.
go back to reference Moore, J.S., and Boyer, R.S. 1991. MJRTY - A Fast Majority Vote Algorithm, In R.S. Boyer (ed.), Automated Reasoning: Essays in Honor of Woody Bledsoe, Automated Reasoning Series, Kluwer Academic Publishers, Dordrecht, The Netherlands, 1991, pp. 105–17. Moore, J.S., and Boyer, R.S. 1991. MJRTY - A Fast Majority Vote Algorithm, In R.S. Boyer (ed.), Automated Reasoning: Essays in Honor of Woody Bledsoe, Automated Reasoning Series, Kluwer Academic Publishers, Dordrecht, The Netherlands, 1991, pp. 105–17.
23.
go back to reference Moschitti, A. 2006. Efficient Convolution Kernels for Dependency and Constituent Syntactic Trees. 2006. In Proceedings of the 17th European Conference on Machine Learning, Berlin, Germany. Moschitti, A. 2006. Efficient Convolution Kernels for Dependency and Constituent Syntactic Trees. 2006. In Proceedings of the 17th European Conference on Machine Learning, Berlin, Germany.
24.
go back to reference Recasens, M., de Marneffe M-C, and Potts, C. 2013. The Life and Death of Discourse Entities: Identifying Singleton Mentions. In Proceedings of NAACL. Recasens, M., de Marneffe M-C, and Potts, C. 2013. The Life and Death of Discourse Entities: Identifying Singleton Mentions. In Proceedings of NAACL.
25.
go back to reference Ricoeur, P. 1975. The Rule of Metaphor: The Creation of Meaning in Language. University of Toronto Press, Toronto. Ricoeur, P. 1975. The Rule of Metaphor: The Creation of Meaning in Language. University of Toronto Press, Toronto.
26.
go back to reference Russell, S., Wefald, E., Karnaugh, M., Karp, R., McAllester, D., Subramanian, D., Wellman, M. 1991. Principles of Metareasoning, Artificial Intelligence, pp. 400–411, Morgan Kaufmann. Russell, S., Wefald, E., Karnaugh, M., Karp, R., McAllester, D., Subramanian, D., Wellman, M. 1991. Principles of Metareasoning, Artificial Intelligence, pp. 400–411, Morgan Kaufmann.
27.
go back to reference Salton, G. and Buckley, C. 1988. Term-weighting approaches in automatic text retrieval. Information Processing & Management 24(5): 513—23. Salton, G. and Buckley, C. 1988. Term-weighting approaches in automatic text retrieval. Information Processing & Management 24(5): 513—23.
28.
go back to reference Searle, 1969. Speech acts: An essay in the philosophy of language. Cambridge, England: Cambridge University. Searle, 1969. Speech acts: An essay in the philosophy of language. Cambridge, England: Cambridge University.
29.
go back to reference Sun, J., Zhang, M., and Tan, C. 2010. Exploring syntactic structural features for sub-tree alignment using bilingual tree kernels. In Proceedings of ACL, 306–315. Sun, J., Zhang, M., and Tan, C. 2010. Exploring syntactic structural features for sub-tree alignment using bilingual tree kernels. In Proceedings of ACL, 306–315.
30.
go back to reference Sun, J., Zhang, M., and Tan. C.L. 2011. Tree Sequence Kernel for Natural Language. AAAI-25. Sun, J., Zhang, M., and Tan. C.L. 2011. Tree Sequence Kernel for Natural Language. AAAI-25.
31.
go back to reference Vapnik, V. 1995. The Nature of Statistical Learning Theory, Springer-Verlag. Vapnik, V. 1995. The Nature of Statistical Learning Theory, Springer-Verlag.
32.
go back to reference Zhang, M., Che, W., Zhou, G., Aw, A., Tan, C., Liu, T., and Li, S. 2008. Semantic role labeling using a grammar-driven convolution tree kernel. IEEE transactions on audio, speech, and language processing. 16(7):1315–29. Zhang, M., Che, W., Zhou, G., Aw, A., Tan, C., Liu, T., and Li, S. 2008. Semantic role labeling using a grammar-driven convolution tree kernel. IEEE transactions on audio, speech, and language processing. 16(7):1315–29.
Metadata
Title
Using Extended Tree Kernel to Recognize Metalanguage in Text
Author
Boris A. Galitsky
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-51052-1_6

Premium Partner